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.9-1 

pIRESpuro2W/ hepEK_k 

1 GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC 

CTGCCTAGCC CTCTAGAGGG CTAGGGGATA CCAGCTGAGA GTCATGTTAG 
51 TGCTCTGATG CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT 

ACGAGACTAC GGCGTATCAA TTCGGTCATA GACGAGGGAC GAACACACAA 
101 GGAGGTCGCT GAGTAGTGCG CGAGCAAAAT TTAAGCTACA ACAAGGCAAG 

CCTCCAGCGA CTCATCACGC GCTCGTTTTA AATTCGATGT TGTTCCGTTC 
151 GCTTGACCGA CAATTGCATG AAGAATCTGC TTAGGGTTAG GCGTTTTGCG 

CGAACTGGCT GTTAACGTAC TTCTTAGACG AATCCCAATC CGCAAAACGC 
201 CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT GATTATTGAC 

GACGAAGCGC TACATGCCCG GTCTATATGC GCAACTGTAA CTAATAACTG 
251 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 

ATCAATAATT ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 
301 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG 

ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 
351 CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 

GGGTTGCTGG GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA 
401 AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAC TATTTACGGT 

TTGCGGTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTG ATAAATGCCA 
451 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 

TTTGACGGGT GAACCGTCAT GTAGTTCACA TAGTATACGG TTCATGCGGG 
501 CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA 

GGATAACTGC AGTTACTGCC ATTTACCGGG CGGACCGTAA TACGGGTCAT 
551 CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 

GTACTGGAAT ACCCTGAAAG GATGAACCGT CATGTAGATG CATAATCAGT 
601 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 

AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCGCACCT 
651 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 

ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 
701 TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 

ACCCTCAAAC AAAACCGTGG TTTTAGTTGC CCTGAAAGGT TTTACAGCAT 
751 ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 

TGTTGAGGCG GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 
801 GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA CTGCTTACTG 

CAGATATATT CGTCTCGAGA GACCGATTGA TCTCTTGGGT GACGAATGAC 
851 GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTTGGTACC 

CGAATAGCTT TAATTATGCT GAGTGATATC CCTCTGGGTT CGAACCATGG 
+3 METDTLLLW 
901 GAGCTCGGAT CGATATCGCC ACCATGGAGA CAGACACACT CCTGCTATGG 

CTCGAGCCTA GCTATAGCGG TGGTACCTCT GTCTGTGTGA GGACGATACC 
+3 V L L L W V P G S T G D A P D RS- 
951 GTACTGCTGC TCTGGGTTCC AGGTTCCACT GGTGACGCTC CGGACAGGAG 

CATGACGACG AGACCCAAGG TCCAAGGTGA CCACTGCGAG GCCTGTCCTC 
+3 SD Q E P L Y P V Q V S S A D A RL 
1001 TGACCAGGAG CCGCTGTACC CAGTGCAGGT CAGCTCTGCG GACGCTCGGC 

ACTGGTCCTC GGCGACATGG GTCACGTCCA GTCGAGACGC CTGCGAGCCG 
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+3 L M V F D K T E G T W R L L C S S 
1051 TCATGGTCTT TGACAAGACG GAAGGGACGT GGCGGCTGCT GTGCTCCTCG 
AGTACCAGAA ACTGTTCTGC CTTCCCTGCA CCGCCGACGA CACGAGGAGC 
+3 R S N A R V AGL SCEE MGF L 
1101 CGCTCTAACG CCAGGGTAGC CGGACTCAGC TGCGAGGAGA TGGGCTTCCT 
GCGAGATTGC GGTCCCATCG GCCTGAGTCG ACGCTCCTCT ACCCGAAGGA 
+3 •LR A L T H S E L D V R T A G A N G 

1151 CAGGGCACTG ACCCACTCCG AGCTGGACGT GCGAACGGCG GGCGCCAATG 
GTCCCGTGAC TGGGTGAGGC TCGACCTGCA CGCTTGCCGC CCGCGGTTAC 
+3 G T S G F F C V D E G R L P H T Q 
1201 GCACGTCGGG CTTCTTCTGT GTGGACGAGG GGAGGCTGCC CCACACCCAG 
CGTGCAGCCC GAAGAAGACA CACCTGCTCC CCTCCGACGG GGTGTGGGTC 
+3 R L L E V I S V C D C P R G R F L 
1251 AGGCTGCTGG AGGTCATCTC CGTGTGTGAT TGCCCCAGAG GCCGTTTCTT 
TCCGACGACC TCCAGTAGAG GCACACACTA ACGGGGTCTC CGGCAAAGAA 
+3 LA A I C Q D C G R R K L P V D DP 

1301 GGCCGCCATC TGCCAAGACT GTGGCCGCAG GAAGCTGCCC GTGGACGACG 
CCGGCGGTAG ACGGTTCTGA CACCGGCGTC CTTCGACGGG CACCTGCTGC 
+3 D D K 1 V G G ROT S L G R W P W 

1351 ACGACAAGAT CGTGGGAGGC CGGGACACCA GCTTGGGCCG GTGGCCGTGG 
TGCTGTTCTA GCACCCTCCG GCCCTGTGGT CGAACCCGGC CACCGGCACC 
+3 Q V S L R Y D G A H L C G G S L L • 
1401 CAAGTCAGCC TTCGCTATGA TGGAGCACAC CTCTGTGGGG GATCCCTGCT 
GTTCAGTCGG AAGCGATACT ACCTCGTGTG GAGACACCCC CTAGGGACGA 
+3 - LS G D W V L T A A H C F P E R NR 

1451 CTCCGGGGAC TGGGTGCTGA CAGCCGCCCA CTGCTTCCCG GAGCGGAACC 

GAGGCCCCTG ACCCACGACT GTCGGCGGGT GACGAAGGGC CTCGCCTTGG 

+3 R V L S R W R V F A G A V A Q A S 

1501 GGGTCCTGTC CCGATGGCGA GTGTTTGCCG GTGCCGTGGC CCAGGCCTCT 

CCCAGGACAG GGCTACCGCT CACAAACGGC CACGGCACCG GGTCCGGAGA 
+3 P H G L Q L G V Q A V V Y H G G Y • 

1551 CCCCACGGTC TGCAGCTGGG GGTGCAGGCT GTGGTCTACC ACGGGGGCTA 

GGGGTGC C AG ACGTCGACCC CCACGTCCGA CACCAGATGG TGCCCCCGAT 

+3 • YL P F R P P N S E E N S N D I AL 

1601 TCTTCCCTTT CGGGACCCCA ACAGCGAGGA GAACAGCAAC GATATTGCCC 

AGAAGGGAAA GCCCTGGGGT TGTCGCTCCT CTTGTCGTTG CTATAACGGG 

+3 L V H L SS P LPL T EY I QPV 

1651 TGGTCCACCT CTCCAGTCCC CTGCCCCTCA CAGAATACAT CCAGCCTGTG 

ACCAGGTGGA GAGGTCAGGG GACGGGGAGT GTCTTATGTA GGTCGGACAC 
+3 C L P A AG Q AL V DGK I CT V 

1701 TGCCTCCCAG CTGCCGGCCA GGCCCTGGTG GATGGCAAGA TCTGTACCGT 

ACGGAGGGTC GACGGCCGGT CCGGGACCAC CTACCGTTCT AGACATGGCA 

+3 VTG W GNT Q YY G QQ A G V L Q 

1751 GACGGGCTGG GGCAACACGC AGTACTATGG CCAACAGGCC GGGGTACTCC 

CTGCCCGACC CCGTTGTGCG TCATGATACC GGTTGTCCGG CCCCATGAGG 
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+3 -Q E A R V P I I SN D VC N GA D 
1801 AGGAGGCTCG AGTCCCCATA ATCAGCAATG ATGTCTGCAA TGGCGCTGAT 
TCCTCCGAGC TCAGGGGTAT TAGTCGTTAC TACAGACGTT ACCGCGACTA 
+3 F Y G N Q I K P K M F C A G Y P E 

1851 TTCTATGGAA ACCAGATCAA GCCCAAGATG TTCTGTGCTG GCTACCCCGA 
AAGATACCTT TGGTCTAGTT CGGGTTCTAC AAGACACGAC CGATGGGGCT 
+3 EG G I D A C Q G D S G G P F V C E 
1901 GGGTGGCATT GATGCCTGCC AGGGCGACAG CGGTGGTCCC TTTGTGTGTG 
CCCACCGTAA CTACGGACGG TCCCGCTGTC GCCACCAGGG AAAC AC AC AC 
+3 E D S I S R T PRW R LC G I V S 
1951 AGGACAGCAT CTCTCGGACG CCACGTTGGC GGCTGTGTGG CATTGTGAGT 
TCCTGTCGTA GAGAGCCTGC GGTGCAACCG CCGACACACC GTAACACTCA 
+3 W G T G C A L A Q K P G V Y T K V- 
2001 TGGGGCACTG GCTGTGCCCT GGCCCAGAAG CCAGGCGTCT ACACCAAAGT 
ACCCCGTGAC CGACACGGGA CCGGGTCTTC GGTCCGCAGA TGTGGTTTCA 
+3 - V S D F R E W I F Q A i K T H S E A 
2051 CAGTGACTTC CGGGAGTGGA TCTTCCAGGC CATAAAGACT CACTCCGAAG 
GTCACTGAAG GCCCTCACCT AG AAGGTC CG GTATTTCTGA GTGAGGCTTC 
+3 A S G M V T Q L E F G K P I P N P 

2101 CCAGCGGCAT GGTGACCCAG CTCGAATTCG GTAAGCCTAT CCCTAACCCT 

GGTCGCCGTA CCACTGGGTC GAGCTTAAGC CATTCGGATA GGGATTGGGA 
+3 L L G L D S T R T G H H H H H H 
2151 CTCCTCGGTC TCGATTCTAC GCGTACCGGT CATCATCACC ATCACCATTG 

GAGGAGCCAG AGCTAAGATG CGCATGGCCA GTAGTAGTGG TAGTGGTAAC 
2201 AGTTTAAAGC GGCCGCATAG ATAACTGATC CAGTGTGCTG GAATTAATTC 

TCAAATTTCG CCGGCGTATC TATTGACTAG GTCACACGAC CTTAATTAAG 
2251 GCTGTCTGCG AGGGCCAGCT GTTGGGGTGA GTACTCCCTC TCAAAAGCGG 

CGACAGACGC TCCCGGTCGA CAACCCCACT CATGAGGGAG AGTTTTCGCC 
2301 GCATGACTTC TGCGCTAAGA TTGTCAGTTT CCAAAAACGA GGAGGATTTG 

CGTACTGAAG ACGCGATTCT AACAGTCAAA GGTTTTTGCT CCTCCTAAAC 
2351 ATATTCACCT GGCCCGCGGT GATGCCTTTG AGGGTGGCCG CGTCCATCTG 

TATAAGTGGA CCGGGCGCCA CTACGGAAAC TCCCACCGGC GCAGGTAGAC 
2401 GTCAGAAAAG ACAATCTTTT TGTTGTCAAG CTTGAGGTGT GGCAGGCTTG 

CAGTCTTTTC TGTTAGAAAA ACAACAGTTC GAACTCCACA CCGTCCGAAC 
2451 AG ATCTGGC C ATACACTTGA GTGACAATGA CATCCACTTT GCCTTTCTCT 

TCTAGACCGG TATGTGAACT CACTGTTACT GTAGGTGAAA CGGAAAGAGA 
2501 CCACAGGTGT CCACTCCCAG GTCCAACTGC AGGTCGAGCA TGCATCTAGG 

GGTGTCCACA GGTGAGGGTC CAGGTTGACG TCCAGCTCGT ACGTAGATCC 
2551 GCGGCCAATT CCGCCCCTCT CCCTCCCCCC CCCCTAACGT TACTGGCCGA 

CGCCGGTTAA GGCGGGGAGA GGGAGGGGGG GGGGATTGCA ATGACCGGCT 
2601 AGCCGCTTGG AATAAGGCCG GTGTGCGTTT GTCTATATGT GATTTTCCAC 

TCGGCGAACC TTATTCCGGC CACACGCAAA CAGATATACA CTAAAAGGTG 
2651 CATATTGCCG TCTTTTGGCA ATGTGAGGGC CCGGAAACCT GGCCCTGTCT 

GTATAACGGC AGAAAACCGT TACACTCCCG GGCCTTTGGA CCGGGACAGA 
2701 TCTTGACGAG CATTCCTAGG GGTCTTTCC C CTCTCGCCAA AGGAATGCAA 

AGAACTGCTC GTAAGGATCC CCAGAAAGGG GAGAGCGGTT TCCTTACGTT 
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2751 GGTCTGTTGA ATGTCGTGAA GGAAGCAGTT CCTCTGGAAG CTTCTTGAAG 
C C AG AC AACT TACAGCACTT CCTTCGTCAA GGAGACCTTC GAAGAACTTC 
2801 ACAAACAACG TCTGTAGCGA CCCTTTGCAG GCAGCGGAAC CCCCCACCTG 
TGTTTGTTGC AGACATCGCT GGGAAACGTC CGTCGCCTTG GGGGGTGGAC 
2851 GCGACAGGTG CCTCTGCGGC CAAAAGCCAC GTGTATAAGA TACACCTGCA 
CGCTGTCCAC GGAGACGCCG GTTTTCGGTG CACATATTCT ATGTGGACGT 
2901 AAGGCGGCAC AACCCCAGTG CCACGTTGTG AGTTGGATAG TTGTGGAAAG 
TTCCGCCGTG TTGGGGTCAC GGTGCAACAC TCAACCTATC AACACCTTTC 
2951 AGTCAAATGG CTCTCCTCAA GCGTATTCAA CAAGGGGCTG AAGGATGCCC 
TCAGTTTACC GAGAGGAGTT CGCATAAGTT GTTCCCCGAC TTCCTACGGG 
3 001 AGAAGGTACC CCATTGTATG GGATCTGATC TGGGGCCTCG GTGCACATGC 
TCTTCCATGG GGTAACATAC CCTAGACTAG ACCCCGGAGC CACGTGTACG 
3051 TTT AC ATGTG TTTAGTCGAG GTTAAAAAAA CGTCTAGGCC CCCCGAACCA 
AAATGTACAC AAATCAGCTC CAATTTTTTT GCAGATCCGG GGGGCTTGGT 
3101 CGGGGACGTG GTTTTCCTTT GAAAAACACG ATGATAAGCT TGCCACAACC 
GCCCCTGCAC CAAAAGGAAA CTTTTTGTGC TACTATTCGA ACGGTGTTGG 
3151 CACAAGGAGA CGACCTTCCA TGACCGAGTA CAAGCCCACG GTGCGCCTCG 
GTGTTCCTCT GCTGGAAGGT ACTGGCTCAT GTTCGGGTGC CACGCGGAGC 
3201 CCACCCGCGA CGACGTCCCC CGGGCCGTAC GCACCCTCGC CGCCGCGTTC 
GGTGGGCGCT GCTGCAGGGG GCCCGGCATG CGTGGGAGCG GCGGCGCAAG 
3251 GCCGACTACC CCGCCACGCG CCACACCGTC GACCCGGACC GCCACATCGA 
CGGCTGATGG GGCGGTGCGC GGTGTGGCAG CTGGGCCTGG CGGTGTAGCT 
3301 GCGGGTCACC GAGCTGCAAG AACTCTTCCT CACGCGCGTC GGGCTCGACA 
CGCCCAGTGG CTCGACGTTC TTGAGAAGGA GTGCGCGCAG CCCGAGCTGT 
3351 TCGGCAAGGT GTGGGTCGCG GACGACGGCG CCGCGGTGGC GGTCTGGACC 
AGCCGTTCCA CACCCAGCGC CTGCTGCCGC GGCGCCACCG CCAGACCTGG 
3401 ACGCCGGAGA GCGTCGAAGC GGGGGCGGTG TTCGCCGAGA TCGGCCCGCG 
TGCGGCCTCT CGCAGCTTCG CCCCCGCCAC AAGCGGCTCT AGCCGGGCGC 
3451 CATGGCCGAG TTGAGCGGTT CCCGGCTGGC CGCGCAGCAA CAGATGGAAG 
GTACCGGCTC AACTCGCCAA GGGCCGACCG GCGCGTCGTT GTCTACCTTC 
3501 GCCTCCTGGC GCCGCACCGG CCCAAGGAGC CCGCGTGGTT CCTGGCCACC 
CGGAGGACCG CGGCGTGGCC GGGTTCCTCG GGCGCACCAA GGACCGGTGG 
3551 GTCGGCGTCT CGCCCGACCA CCAGGGCAAG GGTCTGGGCA GCGCCGTCGT 
CAGCCGCAGA GCGGGCTGGT GGTCCCGTTC CCAGACCCGT CGCGGCAGCA 
3601 GCTCCCCGGA GTGGAGGCGG CCGAGCGCGC CGGGGTGCCC GCCTTCCTGG 
CGAGGGGCCT CACCTCCGCC GGCTCGCGCG GCCCCACGGG CGGAAGGACC 
3651 AGACCTCCGC GCCCCGCAAC CTCCCCTTCT ACGAGCGGCT CGGCTTCACC 
TCTGGAGGCG CGGGGCGTTG GAGGGGAAGA TGCTCGCCGA GCCGAAGTGG 
3701 GTCACCGCCG ACGTCGAGTG CCCGAAGGAC CGCGCGACCT GGTGCATGAC 
CAGTGGCGGC TGCAGCTCAC GGGCTTCCTG GCGCGCTGGA CCACGTACTG 
3751 CCGCAAGCCC GGTGCCTGAC GCCCGCCCCA CGACCCGCAG CGCCCGACCG 
GGCGTTCGGG CCACGGACTG CGGGCGGGGT GCTGGGCGTC GCGGGCTGGC 
3801 AAAGGAGCGC ACGACCCCAT GGCTCCGACC GAAGCCGACC CGGGCGGCCC 
"TTTCCTCGCG TGCTGGGGTA CCGAGGCTGG CTTCGGCTGG GCCCGCCGGG 
3851. CGCCGACCCC GCACCCGCCC CCGAGGCCCA CCGACTCTAG ACTCGAGATC 
GCGGCTGGGG CGTGGGCGGG GGCTCCGGGT GGCTGAGATC TGAGCTCTAG 
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3901 

3951 

4001 

4051 

4101 

4151 

4201 

4251 

4301 

4351 

4401 

4451 

4501 

4551 

4601 

4651 

4701 

4751 

4801 

4851 

4901 

4951 

5001 



GATAATCAAC 

CTATTAGTTG 

TAACTATGTT 

ATTGATACAA 

TGTATCATGC 

ACATAGTACG 

AAATCCTGGT 

TTTAGGACCA 

ACGTGGCGTG 

TGCACCGCAC 

GCATTGCCAC 

CGTAACGGTG 

CCTATTGCCA 

GGATAACGGT 

AGGGGCTCGG 

TCCCCGAGCC 

CATCGTCCTT 

GTAGCAGGAA 

GGGACGTCCT 

CCCTGCAGGA 

TTGCCGCGGC 

AAGGGCGCCG 

GCCCTCAGAC 

CGGGAGTCTG 

TCTAGAGCTC 

AGATCTCGAG 

TGTTGTTTGC 

ACAACAAACG 

CCACTGTCCT 

GGTGACAGGA 

AGGTGTCATT 

TCCACAGTAA 

GGATTGGGAA 

CCTAACCCTT 

CTTCTGAGGC 

GAAGACTCCG 

TTTGTCCAAA 

AAACAGGTTT 

CTAGCTAGAG 

GATCGATCTC 

TGTTATCCGC 

ACAATAGGCG 

TAAAGCCTGG 

ATTTCGGACC 

GCTCACTGCC 

CGAGTGACGG 



CTCTGGATTA 
GAGACCTAAT 
GCTCCTTTTA 
CGAGGAAAAT 
TATTGCTTCC 
ATAACGAAGG 
TGCTGTCTCT 
ACGACAGAGA 
GTGTGCACTG 
CACACGTGAC 
CACCTGTCAG 
GTGGACAGTC 
CGGCGGAACT 
GCCGCCTTGA 
CTGTTGGGCA 
GACAACCCGT 

TCCTTGGCTG 

AGGAACCGAC 

TCTGCTACGT 

AGACGATGCA 

CTGCTGCCGG 

GACGACGGCC 

GAGTCGGATC 

CTCAGCCTAG 

GCTGATCAGC 

CGACTAGTCG 

CCCTCCCCCG 

GGGAGGGGGC 

TTC CTAATAA 

AAGGATTATT 

CTATTCTGGG 

GATAAGACCC 

GACAATAGCA 

CTGTTATCGT 

GGAAAGAACC 

CCTTTCTTGG 

CTCATCAATG 

GAGTAGTTAC 

CTTGGCGTAA 

GAACCGCATT 

TCACAATTCC 

AGTGTTAAGG 

GGTGCCTAAT 

CCACGGATTA 

CGCTTTCCAG 

GCGAAAGGTC 



CAAAATTTGT 

GTTTTAAACA 

CGCTATGTGG 

GCGATACACC 

CGTATGGCTT 

GCATACCGAA 

TTATGAGGAG 

AATACTCCTC 

TGTTTGCTGA 

ACAAACGACT 

CTCCTTTCCG 

GAGGAAAGGC 

CATCGCCGCC 

GTAGCGGCGG 

CTGACAATTC 

GACTGTTAAG 

CTCGCCTGTG 

GAGCGGACAC 

CCCTTCGGCC 

GGGAAGCCGG 

CTCTGCGGCC 

GAGACGCCGG 

TCCCTTTGGG 

AGGGAAACCC 

CTCGACTGTG 

GAGCTGACAC 

TGCCTTCCTT 

ACGGAAGGAA 

AATGAGGAAA 

TTACTCCTTT 

GGGTGGGGTG 

CCCACCCCAC 

GGCATGCTGG 

CCGTACGACC 

AGCTGGGGCT 

TCGACCCCGA 

TATCTTATCA 
ATAGAATAGT 
TCATGGTCAT 
AGTACCAGTA 
ACACAACATA 
TGTGTTGTAT 
GAGTGAGCTA 
CTCACTCGAT 
TCGGGAAACC 
AGCCCTTTGG 



GAAAGATTGA 
CTTTCTAACT 
ATACGCTGCT 
TATGCGACGA 
TCATTTTCTC 
AGTAAAAGAG 
TTGTGGCCCG 
AACACCGGGC 
CGCAACCCCC 
GCGTTGGGGG 
GGACTTTCGC 
CCTGAAAGCG 
TGCCTTGCCC 
ACGGAACGGG 
CGTGGTGTTG 
GCACCACAAC 

TTGCCACCTG 

AACGGTGGAC 

CTCAATCCAG 

GAGTTAGGTC 

TCTTCCGCGT 

AGAAGGCGCA 

CCGCCTCCCC 

GGCGGAGGGG 

CCTTCTAGTT 

GGAAGATCAA 

GACCCTGGAA 

CTGGGACCTT 

TTGCATCGCA 

AACGTAGCGT 

GGGCAGGACA 

CCCGTCCTGT 

GGATGCGGTG 

CCTACGCCAC 

CGAGTGCATT 

GCTCACGTAA 

TGTCTGTATA 

ACAGACATAT 

AGCTGTTTCC 

TCGACAAAGG 

CGAGCCGGAA 

GCTCGGC CTT 

ACTCACATTA 

TGAGTGTAAT 

TGTCGTGCCA 

ACAGCACGGT 



CTGGTATTCT 
GACCATAAGA 
TTAATGCCTT 
AATTACGGAA 
CTCCTTGTAT 
GAGGAACATA 
TTGTCAGGCA 
AACAGTCCGT 
ACTGGTTGGG 
TGACCAACCC 
TTTCCCCCTC 
AAAGGGGGAG 
GCTGCTGGAC 
CGACGACCTG 
TCGGGGAAAT 
AGCCCCTTTA 
GATTCTGCGC 
CTAAGACGCG 
CGGACCTTCC 
GCCTGGAAGG 
CTTCGCCTTC 
GAAGCGGAAG 
GCCTGATCGA 
CGGACTAGCT 
GCCAGCCATC 
CGGTCGGTAG 
GGTGC C ACTC 
CCACGGTGAG 
TTGTCTGAGT 
AACAGACTCA 
GCAAGGGGGA 
CGTTCCCCCT 

GGCTCTATGG 
CCGAGATACC 
C T AGTTGTGG 
GATCAACACC 
CCGTCGACCT 
GGCAGCTGGA 
TGTGTGAAAT 
ACACACTTTA 
GCATAAAGTG 
CGTATTTCAC 
ATTGCGTTGC 
TAACGCAACG 
GCTGCATTAA 
CGACGTAATT 
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5051 TGAATCGGCC AACGCGCGGG 

ACTTAGCCGG TTGCGCGCCC 
5101 CGCTTCCTCG CTCACTGACT 

GCGAAGGAGC GAGTGACTGA 
5151 CGGTATCAGC TCACTCAAAG 

GCCATAGTCG AGTGAGTTTC 
5201 GATAACGCAG GAAAGAACAT 

CTATTGCGTC CTTTCTTGTA 
5251 CCGTAAAAAG GCCGCGTTGC 
. GGCATTTTTC CGGCGCAACG 
5301 ACGAGCATCA CAAAAATCGA 

TGCTCGTAGT GTTTTTAGCT 
5351 GG ACTATAAA GATACCAGGC 

CCTGATATTT CTATGGTCCG 
5401 TCCTGTTCCG ACCCTGCCGC 

AGGACAAGGC TGGGACGGCG 
5451 CGGGAAGCGT GGCGCTTTCT 

GCCCTTCGCA CCGCGAAAGA 
5501 GTGTAGGTCG TTCGCTCCAA 

CACATCCAGC AAGCGAGGTT 
5551 GCCCGACCGC TGCGCCTTAT 

CGGGCTGGCG ACGCGGAATA 
5601 TAAGACACGA CTTATCGCCA 

ATTCTGTGCT GAATAGCGGT 
5651 AGAGCGAGGT AT<GTAGGCGG 

TCTCGCTCCA TACATCCGCC 
5701 CTACGGCTAC ACTAGAAGGA 

GATGCCGATG TGATCTTCCT 
5751 CAGTTACCTT CGGAAAAAGA 

GTCAATGGAA GCCTTTTTCT 
5801 ACCGCTGGTA GCGGTGGTTT 

TGGCGACCAT CGCCACCAAA 
5851 AAAAAAAGGA TCTCAAGAAG 

TTTTTTTCCT AGAGTTCTTC 
5901 CTCAGTGGAA CGAAAACTCA 

GAGTCACCTT GCTTTTGAGT 
5951 AAAAGG ATCT TCACCTAGAT 

TTTTCCTAGA AGTGGATCTA 
6001 AATCTAAAGT ATATATGAGT 

TTAGATTTCA TATATACTCA 
6051 TCAGTGAGGC ACCTATCTCA 

AGTCACTCCG TGGATAGAGT 
6101 GCCTGACTCC CCGTCGTGTA 

CGGACTGAGG GGCAGCACAT 
6151 TGGCCCCAGT GCTGCAATGA 

ACCGGGGTCA CGACGTTACT 
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GAGAGGCGGT TTGCGTATTG GGCGCTCTTC 
CTCTCCGCCA AACGCATAAC CCGCGAGAAG 
CGCTGCGCTC GGTCGTTCGG CTGCGGCGAG 
GCGACGCGAG CCAGCAAGCC GACGCCGCTC 
GCGGTAATAC GGTTATCCAC AGAATCAGGG 
CGCCATTATG CCAATAGGTG TCTTAGTCCC 
GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA 
CACTCGTTTT CCGGTCGTTT TCCGGTCCTT 
TGGCGTTTTT CCATAGGCTC CGCCCCCCTG 
ACCGCAAAAA GGTATC CG AG GCGGGGGGAC 
CGCTCAAGTC AGAGGTGGCG AAACCCGACA 
GCGAGTTCAG TCTCCACCGC TTTGGGCTGT 
GTTTCCCCCT GGAAGCTCCC TCGTGCGCTC 
CAAAGGGGGA CCTTCGAGGG AGCACGCGAG 
TTACCGGATA CCTGTCCGCC TTTCTCCCTT 
AATGGCCTAT GG AC AGGCGG AAAGAGGGAA 
CAATGCTCAC GCTGTAGGTA TCTCAGTTCG 
GTTACGAGTG CGACATCCAT AGAGTCAAGC 
GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 
CGACCCGACA CACGTGCTTG GGGGGCAAGT 
CCGGTAACTA TCGTCTTGAG TCCAACCCGG 
GGCCATTGAT AGCAGAACTC AGGTTGGGCC 
CTGGCAGCAG CCACTGGTAA CAGGATTAGC 
GACCGTCGTC GGTGACCATT GTCCTAATCG 
TGCTACAGAG TTCTTGAAGT GGTGGCCTAA 
ACGATGTCTC AAGAACTTCA CCACCGGATT 
CAGTATTTGG TATCTGCGCT CTGCTGAAGC 
GTCATAAACC ATAGACGCGA GACGACTTCG 
GTTGGTAGCT CTTGATCCGG CAAACAAACC 
CAACCATCGA GAACTAGGCC GTTTGTTTGG 
TTTTGTTTGC AAGCAGCAGA TTACGCGCAG 
AAAACAAACG TTCGTCGTCT AATGCGCGTC 
ATC CTTTG AT CTTTTCTACG GGGTCTGACG 
TAGGAAACTA GAAAAGATGC CCCAGACTGC 
CGTTAAGGGA TTTTGGTCAT GAGATTATCA 
GCAATTCCCT AAAACCAGTA CTCTAATAGT 
CCTTTTAAAT TAAAAATGAA GTTTTAAATC 
GGAAAATTTA ATTTTTACTT CAAAATTTAG 
AAACTTGGTC TGACAGTTAC CAATGCTTAA 
TTTGAACCAG ACTGTCAATG GTTACGAATT 
GCGATCTGTC TATTTCGTTC ATCCATAGTT 
CGCTAGACAG ATAAAGCAAG TAGGTATCAA 
GATAACTACG ATACGGGAGG GCTTACCATC 
CTATTGATGC TATGCCCTCC CGAATGGTAG 
TACCGCGAGA CCCACGCTCA CCGGCTCCAG 
ATGGCGCTCT GGGTGCGAGT GGCCGAGGTC 
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FIG. 9-7 



6201 ATTTATCAGC AATAAACCAG 

TAAATAGTCG TTATTTGGTC 
6251 CCTGCAACTT TATCCGCCTC 

GGACGTTGAA ATAGGCGGAG 
6301 TAGAGTAAGT AGTTCGCCAG 

ATCTCATTCA TCAAGCGGTC 
6351 CTACAGGCAT CGTGGTGTCA 

GATGTCCGTA GCACCACAGT 
6401 TCCGGTTCCC AACGATCAAG 

AGGCCAAGGG TTGCTAGTTC 
64 51 AAAAGCGGTT AGCTCCTTCG 

TTTTCGCCAA TCGAGGAAGC 
6501 CCGCAGTGTT ATCACTCATG 

GGCGTCACAA TAGTGAGTAC 
6551 GTCATGCCAT CCGTAAGATG 

CAGTACGGTA GGCATTCTAC 
6601 GTCATTCTGA GAATAGTGTA 

CAGTAAGACT CTTATCACAT 
6651 CAATACGGGA TAATACCGCG 

GTTATGCCCT ATTATGGCGC 
6701 ATTGGAAAAC GTTCTTCGGG 

TAACCTTTTG CAAGAAGCCC 
6751 GAGATCCAGT TCGATGTAAC 

CTCTAGGTCA AGCTACATTG 
6801 CTTTTACTTT CACCAGCGTT 

GAAAATGAAA GTGGTCGCAA 
6851 GCCGCAAAAA AGGGAATAAG 

CGGCGTTTTT TCCCTTATTC 
6901 CTTCCTTTTT CAATATTATT 

GAAGGAAAAA GTTATAATAA 
6951 GCGGATACAT ATTTGAATGT 

CGCCTATGTA TAAACTTACA 
7001 CGCACATTTC CCCGAAAAGT 
GCGTGTAAAG GGGCTTTTCA 
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CCAGCCGGAA GGGCCGAGCG CAGAAGTGGT 
GGTCGGCCTT CCCGGCTCGC GTCTTCACCA 
CATCCAGTCT ATTAATTGTT GCCGGGAAGC 
GTAGGTCAGA TAATTAACAA CGGCCCTTCG 
TTAATAGTTT GCGCAACGTT GTTGCCATTG 
AATTATCAAA CGCGTTGCAA CAACQGTAAC 
CGCTCGTCGT TTGGTATGGC TTCATTCAGC 
GCGAGCAGCA AACCATACCG AAGTAAGTCG 
GCGAGTTACA TGATCCCCCA TGTTGTGCAA 
CGCTCAATGT ACTAGGGGGT AC AAC AC GTT 
GTCCTCCGAT CGTTGTCAGA AGTAAGTTGG 
CAGGAGGCTA GCAACAGTCT TCATTCAACC 
GTTATGGCAG CACTGCATAA TTCTCTTACT 
CAATACCGTC GTGACGTATT AAGAGAATGA 
CTTTTCTGTG ACTGGTGAGT ACTCAACCAA 
GAAAAGACAC TGACCACTCA TGAGTTGGTT 
TGCGGCGACC GAGTTGCTCT TGCC CGGCGT 
ACGCCGCTGG CTCAACGAGA ACGGGCCGCA 
CCACATAGCA GAACTTTAAA AGTGCTCATC 
GGTGTATCGT CTTGAAATTT TCACGAGTAG 
GCGAAAACTC TCAAGGATCT TACCGCTGTT 
CGCTTTTGAG AGTTCCTAGA ATGGCGACAA 
CCACTCGTGC ACCCAACTGA TCTTCAGCAT 
GGTGAGCACG TGGGTTGACT AGAAGTCGTA 
TC TGGGTG AG CAAAAACAGG AAGGCAAAAT 
AGACCCACTC GTTTTTGTCC TTCCGTTTTA 
GGCGACACGG AAATGTTGAA TACTCATACT 
CCGCTGTGCC TTTACAACTT ATGAGTATGA 
GAAGCATTTA TCAGGGTTAT TGTCTCATGA 
CTTCGTAAAT AGTCCCAATA ACAGAGTACT 
ATTTAGAAAA ATAAACAAAT AGGGGTTCCG 
TAAATCTTTT TATTTGTTTA TCCCCAAGGC 
GCCACCTGAC GTC 
CGGTGGACTG CAG 
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FIG. 10-1 

pCEP4W/hepEK 

1 TCGAGCGGCC GCTTTAAACT CAATGGTGAT GGTGATGATG ACCGGTACGC 
AGCTCGCCGG CGAAATTTGA G TT AC C ACTA CCACTACTAC TGGCCATGCG 
-3 HHHHHHGT RT 

51 GTAGAATCGA GACCGAGGAG AGGGTTAGGG ATAGGCTTAC CGAATTCGAG 
CATCTTAGCT CTGGCTCCTC TCCCAATCCC TATCCGAATG GCTTAAGCTC 
-3 -TSD L GL L PNP I PKG FEL 
101 CTGGGTCACC ATGCCGCTGG CTTCGGAGTG AGTCTTTATG GCCTGGAAGA 
GACCCAGTGG TACGGCGACC GAAGCCTCAC TCAGAAATAC CGGACCTTCT 
-3 QTV MGS A ES H TKI A Q F I • 
151 TCCACTCCCG GAAGTCACTG ACTTTGGTGT AGACGCCTGG CTTCTGGGCC 
AGGTGAGGGC CTTCAGTGAC TGAAACCACA TCTGC GGAC C GAAGACCCGG 
-3 I W E R FDS V KT Y VG P KQAL 
201 AGGGCACAGC CAGTGCCCCA ACTCACAATG CCACACAGCC GCCAACGTGG 
TCCCGTGTCG GTCACGGGGT TGAGTGTTAC GGTGTGTCGG CGGTTGCACC 
-3 LAC G TGW SVI G CL RWRP 
251 CGTCCGAGAG ATGCTGTCCT CACACACAAA GGGACCACCG CTGTCGCCCT 
GCAGGCTCTC TACGACAGGA GTGTGTGTTT CCCTGGTGGC GACAGCGGGA 
-3 TRS I SD ECV F PGG SDGQ- 
301 GGCAGGCATC AATGCCACCC TCGGGGTAGC CAGCACAGAA CATCTTGGGC 
CCGTCCGTAG TTACGGTGGG AGCCCCATCG GTCGTGTCTT GTAGAACCCG 
-3 *Q C A D IGG E PY G AC F MKPK 
351 TTGATCTGGT TTCCATAGAA ATCAGCGCCA TTGCAGACAT CATTGCTGAT 
AACTAGACCA AAGGTATCTT TAGTCGCGGT AACGTCTGTA GTAAC GACTA 
-3 -KIQNGY FDAGNCVDNSI 
401 TATGGGGACT CGAGCCTCCT GGAGTACCCC GGCCTGTTGG CCATAGTACT 
. ATACCCCTGA GCTCGGAGGA CCTCATGGGG CCGGACAACC GGTATCATGA 
-3 IPVRAEQLVGAQQGYYQ- 
451 GCGTGTTGCC CCAGCCCGTC ACGGTACAGA TCTTGCCATC CACCAGGGCC 
CGCACAACGG GGTC GGGC AG TGCCATGTCT AGAACGGTAG GTGGTCCCGG 
•3 Q T N GWGT V TC I KG D VLAQ 
501 TGGCCGGCAG CTGGGAGGCA CACAGGCTGG ATGTATTCTG TGAGGGGCAG 
ACCGGCCGTC GACCCTCCGT GTGTCCGACC TACATAAGAC ACTCCCCGTC 
-3 -QG A A PL C VPQ I YE T LPL 
551 GGGACTGGAG AGGTGGACCA GGGCAATATC GTTGCTGTTC TCCTCGCTGT 
CCCTGACCTC TCCACCTGGT CCCGTTATAG CAACGACAAG AGGAGCGACA 
-3 P S S L HV L A I D NSN E ESN- 
601 TGGGGTCCCG AAAGGGAAGA TAGCCCCCGT GGTAGACCAC AGCCTGCACC 
ACCCCAGGGC TTTCCCTTCT ATCGGGGGCA CCATCTGGTG TCGGACGTGG 
-3 NPD R FPL Y GG H YV V AQVG 
651 CCCAGCTGCA GACCGTGGGG AGAGGCCTGG GCCACGGCAC CGGCAAACAC 
GGGTCGACGT CTGGCACCCC TCTCCGGACC CGGTGC CGTG GCCG TTTGTG 
-3 GLQ L GH P SA Q A VA G A F V 
701 TCGCCATCGG GACAGGACCC GGTTCCGCTC CGGGAAGCAG TGGGCGGCTG 
AGCGGTAGCC CTGTCCTGGG CCAAGGCGAG GCCCTTCGTC ACCCGCCGAC 
-3 RWRSLVRNREPFCHAAT- 
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FIG. 10-2 

pCEP4W/hepEK 

7 51 TCAGCACCCA GTCCCCGGAG AGCAGGGATC CCCCACAGAG GTGTGCTCCA 
AGTCGTGGGT CAGGGGCCTC TCGTCCCTAG GGGGTGTCTC CACACGAGGT 
-3 TLVW DGS L LS G GC L HAGD 
801 TCATAGCGAA GGCTGACTTG CCACGGCCAC CGGCCCAAGC TGGTGTCCCG 
AGTATCGCTT CCGACTGAAC GGTGCCGGTG GCCGGGTTCG ACCACAGGGC 
-3 0YR L SVQWPWR GL S TOR 
851 GCCTCCCACG ATCTTGTCGT CGTCGTCCAC GGGCAGCTTC CTGCGGCCAC 
CGGAGGGTGC TAGAACAGCA GCAGCAGGTG CCCGTCGAAG GACGCCGGTG 
-3 G GV I KODDDVPLKRRGC- 
901 AGTCTTGGCA GATGGCGGCC AAGAAACGGC CTCTGGGGCA ATCACACACG 
TCAGAACCGT CTACCGCCGG TTCTTTGCCG GAGACCCCGT TAGTGTGTGC 
-3 *C D Q C I A A L FR G RP C DCVS 
9 51 GAGATGACCT CCAGCAGCCT CTGGGTGTGG GGCAGCCTCC CCTCGTCCAC 
CTCTACTGGA GGTCGTCGGA GACCCACACC CCGTCGGAGG GGAGCAGGTG 
-3 SI V E LL R QTH P LR G EDV 
1001 ACAGAAGAAG CCCGACGTGC CATTGGCGCC CGCCGTTCGC ACGTCCAGCT 
TGTCTTCTTC GGGCTGCACG GTAACCGCGG GCGGCAAGCG TGCAGGTCGA 
-3 CFF G ST G NAG ATR V OLE* 
1051 CGGAGTGGGT CAGTGCCCTG AGGAAGCCCA TCTCCTCGCA GCTGAGTCCG 
GCCTCACCCA GTCACGGGAC TCCTTCGGGT AGAGGAGCGT CGACTCAGGC 
-3 ESH T LAR L FG M EE C SLGA 
1101 GCTACCCTGG CGTTAGAGCG CGAGGAGCAC AGCAGCCGCC ACGTCCCTTC 
CGATGGGACC GCAATCTCGC GCTCCTCGTG TCGTCGGCGG TGCAGGGAAG 
-3 AVR ANSR SSC L LRWTGE 
1151 CGTCTTGTCA AAGACCATGA GCCGAGCGTC CGCAGAGCTG ACCTGCACTG 
GCAGAACAGT TTCTGGTACT CGGCTCGCAG GCGTCTCGAC TGGACGTGAC 
-3 TKDFVMLRADASSVQVP- 
1201 GGTACAGCGG CTCCTGGTCA CTCCTATCCG GAGCGTCACC AGTGGAACCT 
CCATGTCGCC GAGGACCAGT GAGGATAGGC CTCGCAGTGG TC AC CTTGG A 
-3 PYL P EQD S RD P AD G TSGP 
1251 GGAACCCAGA GCAGCAGTAC CCATAGCAGG AGTGTGTCTG TCTCCATGGT 
CCTTGGGTCT CGTCGTCATG GGTATCGTCC TCACACAGAC AGAGGTA CCA 
-3 -PVW L LL VWLL L TD T EM 
1301 GGCGATCTGG TACCCAGCTT CTAGAGATCT GACGGTTCAC TAAACGAGCT 
CCGCTAGACC ATGGGTCGAA GATCTCTAGA CTGCCAAGTG ATTTGCTCGA 
1351 C TGCTT AT AT AGACCTCCCA CCGTACACGC CTACCGCCCA TTTGCGTCAA 
GACGAATATA TCTGGAGGGT GGCATGTGCG GATGGCGGGT AAACGCAGTT 
1401 CGGGGCGGGG TTATTACGAC ATTTTGGAAA GTCCCGTTGA TTTTGGTGCC 
GCCCCGCCCC AATAATGCTG TAAAAC CTTT CAGGGCAACT AAAACCACGG 
1451 AAAACAAACT CCCATTGACG TCAATGGGGT GGAGACTTGG AAATCCCCGT 
TTTTGTTTGA GGGTAACTGC AGTTACCCCA CCTCTGAACC TTTAGGGGCA 
1501 GAGTCAAACC GCTATCCACG CCCATTGGTG TACTGCCAAA ACCGCATCAC 
CTCAGTTTGG CGATAGGTGC GGGTAACCAC ATGACGGTTT TGGCGTAGTG 
1551 CATGGTAATA GCGATGACTA ATACGTAGAT GTACTGCCAA GTAGGAAAGT 
GTACCATTAT CGCTACTGAT TATGCATCTA CATGACGGTT CATCCTTTCA 
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FIG. 10-3 



1601 CCCGTAAGGT CATGTACTGG 

GGGCATTCCA GTACATGACC 
1651 TTGACGTCAA TAGGGGGCGG 

AACTGCAGTT ATCCCCCGCC 
1701 CAAGTGGGCA GTTTACCGTA 

GTTCACCCGT CAAATGGCAT 
1751 TCCCTATTGG CGTTACTATG 

AGGGATAACC GCAATGATAC 
1801 CGGGGGTCGT TGGGCGGTCA 

GCCCCCAGCA ACCCGCCAGT 
1851 AACGCGGAAC TCCATATATG 

TTGCGCCTTG AGGT AT AT AC 
1901 ACTATTAATA ACTAGTCAAT 

TGATAATTAT TGATCAGTTA 
1951 CATGAGCC AA TATAAATGTA 

GTACTCGGTT ATATTTACAT 
2001 TGGCCAATAG CCAATATTGA 

ACCGGTTATC GGTTATAACT 
2051 CTAATTGCCA ATATTGATTC 

GATTAACGGT TATAACTAAG 
2101 TGCGCCTGCA GCAATGCAAC 

ACGCGGACGT CGTTACGTTG 
2151 TCAAACATGA GAATTGGTCG 

AGTTTGTACT CTTAACCAGC 
2201 TGGTTTTTGG GGGTCGGGGG 

ACCAAAAACC CCCAGCCCCC 
2251 TGTCGCGCCA GTACATGCGG 

ACAGCGCGGT CATGTACGCC 
2 301 TCTGTCTGCT CAGTCCAGTC 

AG AC AG AC G A GTCAGGTCAG 
2351 AATAACCCCC ACGAACCATA 

TTATTGGGGG TGCTTGGTAT 
2401 ACCCACGGGG CCAGTGGCTA 

TGGGTGCCCC GGTCACCGAT 
2451 CGAGCCCTGG GCCTTCACCC 

GCTCGGGACC CGGAAGTGGG 
2501 GAAACGCGGG CGTATTGGCC 

CTTTGCGCCC GCATAACCGG 
2551 GTGCCAGCCC TGGGACCGAA 

CACGGTCGGG ACCCTGGCTT 
2601 CCGTGCGTTT TATTCTGTCT 

GGCACGCAAA ATAAGACAGA 
2651 CGGTATTGTC TCCTTCCGTG 

GCCATAACAG AGGAAGGCAC 
2701 CCTTTGCCCT CGGACGAGTG 

GGAAACGGGA GCCTGCTCAC 



pCEP4W/hepEK 

GCATAATGCC AGGCGGGCCA TTTACCGTCA 
CGTATTACGG TCCGCCCGGT AAATGGCAGT 
ACTTGGCATA TGATACACTT GATGTACTGC 
TGAACCGTAT ACTATGTGAA CTACATGACG 
AATACTCCAC CCATTGACGT CAATGGAAAG 
TTATGAGGTG GGTAACTGCA GTTACCTTTC 
GGAACATACG TCATTATTGA CGTCAATGGG 
CCTTGTATGC AGTAATAACT GCAGTTACCC 
GCCAGGCGGG CCATTTACCG TAAGTTATGT 
CGGTCCGCCC GGTAAATGGC ATTCAATACA 
GGCTATGAAC TAATGACCCC GTAATTGATT 
CCGATACTTG ATTACTGGGG CATTAACTAA 
AATCAATGTC AACATGGCGG TCATATTGGA 
TTAGTTACAG TTGTACCGCC AGTATAACCT 
CATATTATGA TATAGATACA ACGTATGCAA 
GTATAATACT ATATCTATGT TGCATACGTT 
TTTATGCTAT ATAACCAATG ACTAATATGG 
AAATACGATA TATTGGTTAC TGATTATACC 
AATGTATAGA TCTTCCATAC CTACCAGTTC 
TTACATATCT AGAAGGTATG GATGGTCAAG 
AACGTTGCCC GGATCTGCGA TGATAAGCTG 
TTGCAACGGG CCTAGACGCT ACTATTCGAC 
ACTAGCTTGG CACGCCAGAA ATCCGCGCGG 
TGATCGAACC GTGCGGTCTT TAGGCGCGCC 
TGTTTGGCAG CCACAGACGC CCGGTGTTCG 
ACAAACCGTC GGTGTCTGCG GGCCACAAGC 
TCCATGCCCA GGCCATCCAA AAACCATGGG 
AGGTACGGGT CCGGTAGGTT TTTGGTACCC 
GTGGACCAGA CCCCACGCAA CGCCCAAAAT 
CACCTGGTCT GGGGTGCGTT GCGGGTTTTA 
AACCATTCCC CATGGGGGAC CCCGTCCCTA 
TTGGTAAGGG GTACCCCCTG GGGCAGGGAT 
TGGCAGGGCC TGCCGCCCCG ACGTTGGCTG 
ACCGTCCCGG ACGGCGGGGC TGCAACCGAC 
GAACTTGGGG GGTGGGGTGG GGAAAAGGAA 
CTTGAACCCC CCACCCCACC CCTTTTCCTT 
CCAATGGGGT CTCGGTGGGG TATCGACAGA 
GGTTACCCCA GAGCCACCCC ATAGCTGTCT 
CCCCGCGTTT ATGAACAAAC GACCCAACAC 
GGGGCGCAAA TACTTGTTTG CTGGGTTGTG 
TTTTATTGCC GTCATAGCGC GGGTTCCTTC 
AAAATAACGG CAGTATCGCG CCCAAGGAAG 
TTTCAGTTAG CCTCCCCCAT CTCCCCTATT 
AAAGTCAATC GGAGGGGGTA GAGGGGATAA 
CTGGGGCGTC GGTTTCCACT ATCGGCGAGT 
GACCCCGCAG C C AAAGGTG A TAGCCGCTCA 
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FIG. 10-4 
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2751 ACTTCTACAC AGCCATCGGT CCAGACGGCC GCGCTTCTGC GGGCGATTTG 
TGAAGATGTG TCGGTAGCCA GGTCTGCCGG CGCGAAGACG CCCGCTAAAC 
2801 TGTACGCCCG ACAGTCCCGG CTCCGGATCG GACGATTGCG TCGCATCGAC 
ACATGCGGGC TGTCAGGGCC GAGGCCTAGC CTGCTAACGC AGCGTAGCTG 
2851 CCTGCGCCCA AGCTGCATCA TCGAAATTGC CGTCAACCAA GCTCTGATAG 
GGACGCGGGT TCGACGTAGT AGCTTTAACG GCAGTTGGTT CGAGACTATC 
2901 AGTTGGTCAA GACCAATGCG GAGCATATAC GCCCGGAGCC GCGGCGATCC 
TCAACCAGTT CTGGTTACGC CTCGTATATG CGGGCCTCGG CGCCGCTAGG 
2951 TGCAAGCTCC GGATGCCTCC GCTCGAAGTA GCGCGTCTGC TGCTCCATAC 
ACGTTCGAGG CCTACGGAGG CGAGCTTCAT CGCGCAGACG ACGAGGTATG 
3001 AAGCCAACCA CGGCCTCCAG AAGAAGATGT TGGCGACCTC GTATTGGGAA 
TTCGGTTGGT GCCGGAGGTC TTCTTCTACA ACCGCTGGAG CATAACCCTT 
3051 TCCCCGAACA TCGCCTCGCT CCAGTCAATG ACCGCTGTTA TGCGGCCATT 
AGGGGCTTGT AGCGGAGCGA GGTCAGTTAC TGGCGACAAT ACGCCGGTAA 
3101 GTCCGTCAGG ACATTGTTGG AGCCGAAATC CGCGTGCACG AGGTGCCGGA 
CAGGCAGTCC TGTAACAACC TCGGCTTTAG GCGCACGTGC TCCACGGCCT 
3151 CTTCGGGGCA GTCCTCGGCC CAAAGCATCA GCTCATCGAG AGCCTGCGCG 
GAAGCCCCGT CAGGAGCCGG GTTTCGTAGT CGAGTAGCTC TCGGACGCGC 
3201 ACGGACGCAC TGACGGTGTC GTCCATCACA GTTTGCCAGT GATACACATG 
TGCCTGCGTG ACTGCCACAG CAGGTAGTGT CAAACGGTCA C T ATGTGT AC 
3251 GGGATCAGCA ATCGCGC ATA TGAAATCACG CCATGTAGTG TATTGACCGA 
CCCTAGTCGT TAGCGCGTAT ACTTTAGTGC GGTACATCAC ATAACTGGCT 
3301 TTCCTTGCGG TCCGAATGGG CCGAACCCGC TCGTCTGGCT AAGATCGGCC 
AAGGAACGCC AGGCTTACCC GGCTTGGGCG AGCAGACCGA TTCTAGCCGG 
3351 GCAGCGATCG CATCCATGGC CTCCGCGACC GGCTGCAGAA CAGCGGGCAG 
CGTCGCTAGC GTAGGTACCG GAGGCGCTGG CCGACGTCTT GTCGCCCGTC 
3401 TTCGGTTTCA GGCAGGTCTT GCAACGTGAC ACCCTGTGCA CGGCGGGAGA 
AAGCCAAAGT CCGTCCAGAA CGTTGCACTG TGGGACACGT GCCGCCCTCT 
3451 TGCAATAGGT CAGGCTCTCG CTGAATTCCC CAATGTCAAG CACTTCCGGA 
ACGTTATC C A GTCCGAGAGC GACTTAAGGG GTT AC AGTTC GTGAAGGCCT 
3501 ATCGGGAGCG CGGCCGATGC AAAGTGCCGA TAAACATAAC GATCTTTGTA 
TAGCCCTCGC GCCGGCTACG TTTCACGGCT ATTTGTATTG CTAGAAACAT 
3551 GAAACCATCG GCGCAGCTAT TTACCCGCAG G AC ATATC C A CGCCCTCCTA 
CTTTGGTAGC CGCGTCGATA AATGGGCGTC CTGTATAGGT GCGGGAGGAT 
3 601 CATCGAAGCT GAAAGCACGA GATTCTTCGC CCTCCGAGAG CTGCATCAGG 
GTAGCTTCGA CTTTCGTGCT CTAAGAAGCG GGAGGCTCTC GACGTAGTCC 
3651 TCGGAGACGC TGTCGAACTT TTCGATCAGA AACTTCTCGA CAGACGTCGC 
AGCCTCTGCG ACAGCTTGAA AAGCTAGTCT TTGAAGAGCT GTCTGCAGCG 
3701 GGTGAGTTCA GGCTTTTTCA TATCTCATTG CCCGGGATCT GCGGCACGCT 
CCACTCAAGT CCGAAAAAGT ATAGAGTAAC GGGCCCTAGA CGCCGTGCGA 
37 51 GTTGACGCTG TTAAGCGGGT CGCTGCAGGG TCGCTCGGTG TTCGAGGCCA 
CAACTGCGAC AATTCGCCCA GCGACGTCCC AGCGAGCCAC AAGCTCCGGT 
3801 CACGCGTCAC CTTAATATGC GAAGTGGACC TGGGACCGCG CCGCCCCGAC 
GTGCGCAGTG GAATTATACG CTTCACCTGG ACCCTGGCGC GGCGGGGCTG 
3851 TGCATCTGCG TGTTCGAATT CGCCAATGAC AAGACGCTGG G CGGGGTTTG 
ACGTAGACGC ACAAGCTTAA GCGGTTACTG TTCTGCGACC CGCCCCAAAC 
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FIG. 10-5 
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3901 TGTCATCATA GAACTAAAGA CATGCAAATA TATTTCTTCC GGGGACACCG 
ACAGTAGTAT CTTGATTTCT GTACGTTTAT ATAAAGAAGG CCCCTGTGGC 
39 51 CCAGCAAACG CGAGCAACGG GCCACGGGGA TGAAGCAGGG CATGGCGGCC 
GGTCGTTTGC GCTCGTTGCC CGGTGCCCCT ACTTCGTCCC GTACCGCCGG 
4001 GACGCGCTGG GCTACGTCTT GCTGGCGTTC GCGACGCGAG GCTGGATGGC 
CTGCGCGACC CGATGCAGAA CGACCGCAAG CGCTGCGCTC CGACCTACCG 
4051 CTTCCCCATT ATGATTCTTC TCGCTTCCGG CGGCATCGGG ATGCCCGCGT 
GAAGGGGTAA TACTAAGAAG AGCGAAGGCC GCCGTAGCCC TACGGGCGCA 
4101 TGCAGGCCAT GCTGTCCAGG CAGGTAGATG ACGACCATCA GGGACAGCTT 
ACGTCCGGTA CGACAGGTCC GTCCATCTAC TGCTGGTAGT CCCTGTCGAA 
4151 CAAGGATCGC TCGCGGCTCT TACCAGCCTA ACTTCGATCA CTGGACCGCT 
GTTCCTAGCG AGCGCCGAGA ATGGTCGGAT TGAAGCTAGT GACCTGGCGA 
4201 GATCGTCACG GCGATTTATG CCGCCTCGGC GAGCACATGG AACGGGTTGG 
CTAGCAGTGC CGCTAAATAC GGCGGAGCCG CTCGTGTACC TTGCCCAACC 
4251 CATGGATTGT AGGCGCCGCC CTATACCTTG TCTGCCTCCC CGCGTTGCGT 
GTACCTAACA TCCGCGGCGG GATATGGAAC AGACGGAGGG GCGCAACGCA 
4301 CGCGGTGCAT GGAGCCGGGC CACCTCGACC TGAATGGAAG CCGGCGGCAC 
GCGCCACGTA CCTCGGCCCG GTGGAGCTGG ACTTACCTTC GGCCGCCGTG 
4351 CTCGCTAACG GATTCACCAC TCCAAGAATT GGAGCCAATC AATTCTTGCG 
GAGCGATTGC CTAAGTGGTG AGGTTCTTAA CCTCGGTTAG TTAAGAACGC 
4401 GAGAACTGTG AATGCGCAAA CCAACCCTTG GCAGAACATA TCCATCGCGT 
CTCTTGACAC TTACGCGTTT GGTTGGGAAC CGTCTTGTAT AGGTAGCGCA 
4451 CCGCCATCTC CAGCAGCCGC ACGCGGCGCA GCAAAAGGCC AGGAAC CGTA 
GGCGGTAGAG GTCGTCGGCG TGCGCCGCGT CGTTTTCCGG TCCTTGGCAT 
4501 AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC CCCTGACGAG 
TTTTC CGGCG CAACGACCGC AAAAAGGTAT CCGAGGCGGG GGGACTGCTC 
4551 CATCACAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC CGACAGGACT 
GTAGTGTTTT TAGCTGCGAG TTCAGTCTCC ACCGCTTTGG GCTGTCCTGA 
4601 ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG 
TATTTCTATG GTCCGCAAAG GGGGACCTTC GAGGGAGCAC GCGAGAGGAC 
4651 TTCCGACCCT GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA 
AAGGCTGGGA CGGCGAATGG CCTATGGACA GGCGGAAAGA GGGAAGCCCT 
4701 AGCGTGGCGC TTTCTCATAG CTCACGCTGT AGGTATCTCA GTTCGGTGTA 
TCGCACCGCG AAAGAGTATC GAGTGCGACA TCCATAGAGT CAAGCCACAT 
4751 GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA CGAACCCCCC GTTCAGCCCG 
CCAGCAAGCG AGGTTCGACC CGACACACGT GCTTGGGGGG CAAGTCGGGC 
4801 ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA CCCGGTAAGA 
TGGCGACGCG GAATAGGCCA TTGATAGCAG AACTCAGGTT GGGCCATTCT 
4851 CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA TTAGCAGAGC 
GTGCTGAATA GCGGTGACCG TCGTCGGTGA CCATTGTCCT AATCGTCTCG 
4901 GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG CCTAACTACG 
CTCCATACAT CCGCCACGAT GTCTCAAGAA CTTCACCACC GGATTGATGC 
4951 GCTACACTAG AAGGACAGTA TTTGGTATCT GCGCTCTGCT GAAGCCAGTT 
CGATGTGATC TTCCTGTCAT AAACCATAGA CGCGAGACGA CTTCGGTCAA 
5001 ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC AAACCACCGC 
TGGAAGCCTT TTTCTCAACC ATCGAGAACT AGGCCGTTTG TTTGGTGGCG 
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5051 TGGTAGCGGT GGTTTTTTTG TTTGCAAGCA GCAGATTACG CGCAGAAAAA 
ACCATCGCCA CCAAAAAAAC AAACGTTCGT CGTCTAATGC GCGTCTTTTT 
5101 AAGGATCTCA AGAAGATCCT TTGATCTTTT CTACGGGGTC TGACGCTCAG 
TTCCTAGAGT TCTTCTAGGA AACTAGAAAA GATGCCCCAG ACTGCGAGTC 
5151 TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT TATCAAAAAG 
ACCTTGCTTT TGAGTGCAAT TCCCTAAAAC CAGTACTCTA ATAGTTTTTC 
5201 GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT AAATCAATCT 
CTAGAAGTGG ATCTAGGAAA ATTTAATTTT TACTTCAAAA TTTAGTTAGA 
5251 AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT 
TTTCATATAT ACTCATTTGA ACCAGACTGT CAATGGTTAC GAATTAGTCA 
5301 GAGGCACCTA TCTCAGCGAT C TGTCT ATTT CGTTCATCCA TAGTTGCCTG 
CTCCGTGGAT AGAGTCGCTA GACAGATAAA GCAAGTAGGT ATCAACGGAC 
5351 ACTCCCCGTC GTGTAGATAA CTACGATACG GGAGGGCTTA CCATCTGGCC 
TGAGGGGCAG CACATCTATT GATGCTATGC CCTCCCGAAT GGTAGACCGG 
5401 CCAGTGCTGC AATGATACCG CGAGACCCAC GCTCACCGGC TCC AG ATTT A 
GGTCACGACG TTACTATGGC GCTCTGGGTG CGAGTGGCCG AGGTCTAAAT 
5451 TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA GTGGTCCTGC 
AGTCGTTATT TGGTCGGTCG GCCTTCCCGG CTCGCGTCTT CACCAGGACG 
5501 AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG 
TTGAAATAGG CGGAGGTAGG TCAGATAATT AACAACGGCC CTTCGATCTC 
5551 TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTGCA 
ATTCATCAAG CGGTCAATTA TCAAACGCGT TGCAACAACG GTAACGACGT 
5601 GGCATCGTGG TGTCACGCTC GTCGTTTGGT ATGGCTTCAT TCAGCTCCGG 
CCGTAGCACC ACAGTGCGAG CAGCAAACCA TACCGAAGTA AGTCGAGGCC 
5651 TTCCCAACGA TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG 
AAGGGTTGCT AGTTCCGCTC AATGTACTAG GGGGTACAAC ACGTTTTTTC 
5701 CGGTTAGCTC CTTCGGTCCT CCGATCGTTG TCAGAAGTAA GTTGGCCGCA 
GCCAATCGAG GAAGCCAGGA GGCTAGCAAC AGTCTTCATT CAACCGGCGT 
5751 GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC TTACTGTCAT 
CACAATAGTG AGTACCAATA CCGTCGTGAC GTATTAAGAG AATGACAGTA 
5801 GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA ACCAAGTCAT 
CGGTAGGCAT TCTACGAAAA GACACTGACC ACTCATGAGT TGGTTCAGTA 
5851 TCTGAGAATA GTGTATGCGG CGACCGAGTT GCTCTTGCCC GGCGTCAACA 
AGACTCTTAT CACATACGCC GCTGGCTCAA CGAGAACGGG CCGCAGTTGT 
5901 CGGGATAATA CCGCGCCACA TAGCAGAACT TTAAAAGTGC TCATCATTGG 
GCCCTATTAT GGCGCGGTGT ATCGTCTTGA AATTTTCACG AGTAGTAACC 
5951 AAAACGTTCT TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTTGAGAT 
TTTTGC AAG A AGCCCCGCTT TTGAGAGTTC CTAGAATGGC GACAACTCTA 
6001 CCAGTTCGAT GTAACCCACT CGTGCACCCA ACTGATCTTC AGCATCTTTT 
GGTCAAGCTA CATTGGGTGA GCACGTGGGT TGACTAGAAG TCGTAGAAAA 
6051 ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC AAAATGCCGC 
TGAAAGTGGT CGCAAAGACC CACTCGTTTT TGTCCTTCCG TTTTACGGCG 
6101 AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC 
TTTTTTCCCT TATTCCCGCT GTGCCTTTAC AACTTATGAG TATGAGAAGG 
6151 TTTTTC AAT A TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGAGCGGA 
AAAAAGTTAT AATAACTTCG TAAATAGTCC CAATAACAGA GTACTCGCCT 
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6201 TACATATTTG AATGTATTTA GAAAAATAAA CAAATAGGGG TTCCGCGCAC 
ATGTATAAAC TT AC AT AAAT CTTTTTATTT GTTTATCCCC AAGGCGCGTG 
6251 ATTTCCCCGA AAAGTGCCAC CTGACGTCTA AGAAACCATT ATTATCATGA 
TAAAGGGGCT TTTCACGGTG GACTGCAGAT TCTTTGGTAA TAATAGTACT 
6301 CATTAACCTA TAAAAATAGG CGTATCACGA GGCCCTTTCG TCTTCAAGAA 
GTAATTGGAT ATTTTTATCC GCATAGTGCT CCGGGAAAGC AGAAGTTCTT 
6351 TTCTCATGTT TGACAGCTTA TCATCGATAA GCTGATCCTC ACAGGCCGCA 
AAGAGTACAA AC TGTCG AAT AGTAGCTATT CGACTAGGAG TGTCCGGCGT 
6401 CCCAGCTTTT CTTCCGTTGC CCCAGTAGCA TCTCTGTCTG GTGACCTTGA 
GGGTCGAAAA GAAGGCAACG GGGTCATCGT AGAGACAGAC CACTGGAACT 
6451 AGAGGAAGAG GAGGGGTCCC GAGAATCCCC ATCCCTACCG TCCAGCAAAA 
TCTCCTTCTC CTCCCCAGGG CTCTTAGGGG TAGGGATGGC AGGTCGTTTT 
6501 AGGGGGACGA GGAATTTGAG GCCTGGCTTG AGGCTCAGGA CGCAAATCTT 
TCCCCCTGCT CCTTAAACTC CGGACCGAAC TCCGAGTCCT GCGTTTAGAA 
6551 GAGGATGTTC AGCGGGAGTT TTCCGGGCTG CGAGTAATTG GTGATGAGGA 
CTCCTACAAG TCGCCCTC AA AAGGCCCGAC GCTC ATTAAC CACTACTCCT 
6601 CGAGGATGGT TCGGAGGATG GGGAATTTTC AGACCTGGAT CTGTCTGACA 
GCTCCTACCA AGCCTCCTAC CCCTTAAAAG TCTGGACCTA GACAGACTGT 
6651 GCGACCATGA AGGGGATGAG GGTGGGGGGG CTGTTGGAGG GGGCAGGAGT 
CGCTGGTACT TCCCCTACTC CCACCCCCCC GACAACCTCC CCCGTCCTCA 
6701 CTGCACTCCC TGTATTCACT GAGCGTCGTC TAATAAAGAT GTCTATTGAT 
GACGTGAGGG ACATAAGTGA CTCGCAGCAG ATTATTTCTA CAGATAACTA 
6751 CTCTTTTAGT GTGAATCATG TCTGACGAGG GGCCAGGTAC AGGACCTGGA 
GAGAAAATCA CACTTAGTAC AGACTGCTCC CCGGTCCATG TCCTGGACCT 
6801 AATGGC CTAG GAGAGAAGGG AGACACATCT GGACCAGAAG GCTCCGGCGG 
TTACCGGATC CTCTCTTCCC TCTGTGTAGA CCTGGTCTTC CGAGGCCGCC 
6851 CAGTGGACCT CAAAGAAGAG GGGGTGATAA CCATGGACGA GGACGGGGAA 
GTCACCTGGA GTTTCTTCTC CCCCACTATT GGTACCTGCT CCTGCCCCTT 
6901 GAGGACGAGG ACGAGGAGGC GGAAGACCAG GAGCCCCGGG CGGCTCAGGA 
CTCCTGCTCC TGCTCCTCCG CCTTCTGGTC CTCGGGGCCC GCCGAGTCCT 
6951 TCAGGGCCAA GACATAGAGA TGGTGTCCGG AGACCCCAAA AACGTCCAAG 
AGTCCCGGTT CTGTATCTCT ACCACAGGCC TCTGGGGTTT TTGCAGGTTC 
7001 TTGCATTGGC TGCAAAGGGA CCCACGGTGG AACAGGAGCA GGAGCAGGAG 
AACGTAACCG ACGTTTCCCT GGGTGCCACC TTGTCCTCGT CCTCGTCCTC 
7051 CGGGAGGGGC AGGAGCAGGA GGGGCAGGAG CAGGAGGAGG GGCAGGAGCA 
GCCCTCCCCG TCCTCGTCCT CCCCGTCCTC GTCCTCCTCC CCGTCCTCGT 
7101 GGAGGAGGGG CAGGAGGGGC AGGAGGGGCA GGAGGGGCAG GAGCAGGAGG 
CCTCCTCCCC GTCCTCCCCG TCCTCCCCGT CCTCCCCGTC CTCGTCCTCC 
7151 AGGGGCAGGA GCAGGAGGAG GGGCAGGAGG GGCAGGAGGG GCAGGAGCAG 
TCCCCGTCCT CGTCCTCCTC CCCGTCCTCC CCGTCCTCCC CGTCCTCGTC 
7201 GAGGAGGGGC AGGAGCAGGA GGAGGGGCAG GAGGGGCAGG AGCAGGAGGA 
CTCCTCCCCG TCCTCGTCCT CCTCCCCGTC CTCCCCGTCC TCGTCCTCCT 
7251 GGGGCAGGAG GGGCAGGAGG GGCAGGAGCA GGAGGAGGGG CAGGAGCAGG 
CCCCGTCCTC CCCGTCCTCC CCGTCCTCGT CCTCCTCCCC GTCCTCGTCC 
7301 AGGAGGGGCA GGAGGGGCAG GAGCAGGAGG AGGGGCAGGA GGGGCAGGAG 
TCCTCCCCGT CCTCCCCGTC CTCGTCCTCC TCCCCGTCCT CCCCGTCCTC 
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7 351 GGGCAGGAGC AGGAGGAGGG GCAGGAGCAG GAGGGGCAGG AGGGGCAGGA 
CCCGTCCTCG TCCTCCTCCC CGTCCTCGTC CTCCCCGTCC TCCCCGTCCT 
7 401 GGGGCAGGAG CAGGAGGGGC AGGAGCAGGA GGAGGGGCAG GAGGGGCAGG 
CCCCGTCCTC GTCCTCCCCG TCCTCGTCCT CCTCCCCGTC CTCCCCGTCC 
7451 AGGGGCAGGA GCAGGAGGGG CAGGAGCAGG AGGGGCAGGA GCAGGAGGGG 
TCCCCGTCCT CGTCCTCCCC GTCCTCGTCC TCCCCGTCCT CGTCCTCCCC 
7 501 CAGGAGCAGG AGGGGCAGGA GGGGCAGGAG CAGGAGGGGC AGGAGGGGCA 
GTCCTCGTCC TCCCCGTCCT CCCCGTCCTC GTCCTCCCCG TCCTCCCCGT 
7 551 GGAGCAGGAG GGGCAGGAGG GGCAGGAGCA GGAGGAGGGG CAGGAGGGGC 
CCTCGTCCTC CCCGTCCTCC CCGTCCTCGT CCTCCTCCCC GTCCTCCCCG 
7 601 AGGAGCAGGA GGAGGGGCAG GAGGGGCAGG AGCAGGAGGG GCAGGAGGGG 
TCCTCGTCCT CCTCCCCGTC CTCCCCGTCC TCGTCCTCCC CGTCCTCCCC 
7 651 CAGGAGCAGG AGGGGCAGGA GGGGCAGGAG CAGGAGGGGC AGGAGGGGCA 
GTCCTCGTCC TCCCCGTCCT CCCCGTCCTC GTCCTCCCCG TCCTCCCCGT 
77 01 GGAGCAGGAG GAGGGGCAGG AGCAGGAGGG GCAGGAGCAG GAGGTGGAGG 
CCTCGTCCTC CTCCCCGTCC TCGTCCTCCC CGTCCTCGTC CTCCACCTCC 
7751 CCGGGGTCGA GGAGGCAGTG GAGGCCGGGG TCGAGGAGGT AGTGGAGGCC 
GGCCCCAGCT CCTCCGTCAC CTCCGGCCCC AGCTCCTCCA TCACCTCCGG 
7801 GGGGTCGAGG AGGTAGTGGA GGCCGCCGGG GTAGAGGACG TGAAAGAGCC 
CCCCAGCTCC TCCATCACCT CCGGCGGCCC CATCTCCTGC ACTTTCTCGG 
7851 AGGGGGGGAA GTCGTGAAAG AGCCAGGGGG AGAGGTCGTG GACGTGGAGA 
TCCCCCCCTT CAGCACTTTC TCGGTCCCCC TCTCCAGCAC CTGCACCTCT 
7901 AAAGAGGCCC AGGAGTCCCA GTAGTCAGTC ATCATCATCC GGGTCTCCAC 
TTTCTCCGGG TCCTCAGGGT CATCAGTCAG TAGTAGTAGG CCCAGAGGTG 
7 951 CGCGCAGGCC CCCTCCAGGT AGAAGGCCAT TTTTCCACCC TGTAGGGGAA 
GCGCGTCCGG GGGAGGTCCA TCTTCCGGTA AAAAGGTGGG ACATCCCCTT 
8001 GCCGATTATT TTGAATACCA C C AAGAAGGT GGCCCAGATG GTGAGCCTGA 
CGGCTAATAA AACTTATGGT GGTTCTTCCA CCGGGTCTAC CACTCGGACT 
8051 CGTGCCCCCG GGAGCGATAG AGCAGGGCCC CGCAGATGAC CCAGGAGAAG 
GCACGGGGGC CCTCGCTATC TCGTCCCGGG GCGTCTACTG GGTCCTCTTC 
8101 GCCCAAGCAC TGGACCCCGG GGTCAGGGTG ATGGAGGCAG GCGCAAAAAA 
CGGGTTCGTG ACCTGGGGCC CCAGTCCCAC TACCTCCGTC CGCGTTTTTT 
8151 GGAGGGTGGT TTGGAAAGCA TCGTGGTCAA GGAGGTTCCA ACCCGAAATT 
CCTCCCACCA AACCTTTCGT AGCACCAGTT CCTCCAAGGT TGGGCTTTAA 
8201 TGAGAACATT GCAGAAGGTT TAAGAGCTCT CCTGGCTAGG AGTCACGTAG 
ACTCTTGTAA CGTCTTCCAA ATTCTCGAGA GGACCGATCC TCAGTGCATC 
8251 AAAGGACTAC CGACGAAGGA ACTTGGGTCG CCGGTGTGTT CGTATATGGA 
TTTCCTGATG GCTGCTTCCT TGAACCCAGC GGCCACACAA GCATATACCT 
8301 GGTAGTAAGA CCTCCCTTTA CAACCTAAGG CGAGGAACTG CCCTTGCTAT 
CCATCATTCT GGAGGGAAAT GTTGGATTCC GCTCCTTGAC GGGAACGATA 
8351 TCCACAATGT CGTCTTACAC CATTGAGTCG TCTCCCCTTT GGAATGGCCC 
AGGTGTTACA GCAGAATGTG GTAACTCAGC AGAGGGGAAA CCTTACCGGG 
8401 CTGGACCCGG CCCACAACCT GGCCCGCTAA GGGAGTCCAT TGTCTGTTAT 
GACCTGGGCC GGGTGTTGGA CCGGGCGATT CCCTCAGGTA ACAGACAATA 
8451 TTC ATGGTCT TTTTACAAAC TCATATATTT GCTGAGGTTT TGAAGGATGC 
AAGTACCAGA AAAATGTTTG AGTATATAAA CGACTCCAAA ACTTCCTACG 
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8501 GATTAAGGAC CTTGTTATGA CAAAGCCCGC TCCTACCTGC AATATCAGGG 
CTAATTCCTG GAACAATACT GTTTCGGGCG AGGATGGACG TTATAGTCCC 
8551 TG AC TGTGTG CAGCTTTGAC GATGGAGTAG ATTTGCCTCC CTGGTTTC C A 
ACTGACACAC GTCGAAACTG CTACCTCATC TAAACGGAGG GACCAAAGGT 
8601 CCTATGGTGG AAGGGGCTGC CGCGGAGGGT GATGACGGAG ATGACGGAGA 
GGATACCACC TTCCCCGACG GCGCCTCCCA CTACTGCCTC TACTGCCTCT 
8651 TGAAGGAGGT GATGGAGATG AGGGTGAGGA AGGGCAGGAG TGATGTAACT 
ACTTCCTCCA CTACCTCTAC TCCCACTCCT TCCCGTCCTC ACTACATTGA 
8701 TGTTAGGAGA CGCCCTCAAT CGTATTAAAA GCCGTGTATT CCCCCGCACT 
ACAATCCTCT GCGGGAGTTA GC AT AATTTT CGGCACATAA GGGGGCGTGA 
87 51 AAAGAATAAA TCCCCAGTAG ACATCATGCG TGCTGTTGGT GTATTTCTGG 
TTTCTTATTT AGGGGTCATC TGTAGTACGC ACGACAACCA CATAAAGACC 
8801 CCATCTGTCT TGTCACCATT TTCGTCCTCC C AAC ATGGGG CAATTGGGCA 
GGTAGACAGA ACAGTGGTAA AAGCAGGAGG GTTGTACCCC GTTAACCCGT 
8851 TACCCATGTT GTCACGTCAC TCAGCTCCGC GCTCAACACC TTCTCGCGTT 
ATGGGTACAA CAGTGCAGTG AGTCGAGGCG CGAGTTGTGG AAGAGCGCAA 
8901 GGAAAACATT AGCGACATTT ACCTGGTGAG CAATCAGACA TGCGACGGCT 
CCTTTTGTAA TCGCTGTAAA TGGACCACTC GTTAGTCTGT ACGCTGCCGA 
8951 TTAGCCTGGC CTCCTTAAAT TCACCTAAGA ATGGGAGCAA CCAGCATGCA 
AATCGGACCG GAGGAATTTA AGTGGATTCT TACCCTCGTT GGTCGTACGT 
9001 GGAAAAGGAC AAGCAGCGAA AATTCACGCC CCCTTGGGAG GTGGCGGCAT 
CCTTTTCCTG TTCGTCGCTT TTAAGTGCGG GGGAACCCTC CACCGCCGTA 
9051 ATGCAAAGGA TAGCACTCCC ACTCTACTAC TGGGTATCAT ATGCTGACTG 
TACGTTTCCT ATCGTGAGGG TGAGATGATG ACCCATAGTA TACGACTGAC 
9101 TATATGCATG AGGATAGCAT ATGCTACCCG GATACAGATT AGGATAGCAT 
ATATACGTAC TCCTATCGTA TACGATGGGC CTATGTCTAA TCCTATCGTA 
9151 ATACTACCCA GATATAGATT AGGATAGCAT ATGCTACCCA GATATAGATT 
TATGATGGGT CTATATCTAA TCCTATCGTA TACGATGGGT CTATATCTAA 
9201 AGGATAGCCT ATGCTACCCA GATATAAATT AGGATAGCAT ATACTACCCA 
TCCTATCGGA TACGATGGGT CTATATTTAA TCCTATCGTA TATGATGGGT 
9251 GATATAGATT AGGATAGCAT ATGCTACCCA GATATAGATT AGGATAGCCT 
CTATATCTAA TCCTATCGTA TACGATGGGT CTATATCTAA TCCTATCGGA 
9301 ATGCTACCCA GATATAGATT AGGATAGCAT ATGCTACCCA GATATAGATT 
TACGATGGGT CTATATCTAA TCCTATCGTA TACGATGGGT CTATATCTAA 
9351 AGGATAGCAT ATGCTATCCA GATATTTGGG TAGTATATGC TACCCAGATA 
TCCTATCGTA TACGATAGGT CTATAAACCC ATCATATACG ATGGGTCTAT 
9401 TAAATTAGGA T AGC AT AT AC TACCCTAATC TCTATTAGGA TAGCATATGC 
ATTTAATCCT ATCGTATATG ATGGGATTAG AGATAATCCT ATCGTAT AC G 
9451 TACCCGGATA CAGATTAGGA TAGCATATAC TACCCAGATA TAGATTAGGA 
ATGGGCCTAT GTCTAATCCT ATCGTATATG ATGGGTCTAT ATCTAATCCT 
9 501 TAGCATATGC TACCCAGATA TAGATTAGGA TAGCCTATGC TACCCAGATA 
ATCGTAT ACG ATGGGTCTAT ATCTAATCCT ATCGGATACG ATGGGTCTAT 
9551 TAAATTAGGA TAGCATATAC TACCCAGATA TAGATTAGGA TAGCATATGC 
ATTTAATCCT ATCGTATATG ATGGGTCTAT ATCTAATCCT ATCGTAT ACG 
9601 TACCCAGATA TAGATTAGGA TAGCCTATGC TACCCAGATA TAGATTAGGA 
ATGGGTCTAT ATCTAATCCT ATCGGATACG ATGGGTCTAT ATCTAATCCT 
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9651 TAGCATATGC TATCCAGATA TTTGGGTAGT ATATGCTACC CATGGCAACA 
ATCGTATACG ATAGGTCTAT AAACCCATCA TATACGATGG GTACCGTTGT 
9701 TTAGCCCACC GTGCTCTCAG CGACCTCGTG AATATGAGGA CCAACAACCC 
AATCGGGTGG CACGAGAGTC GCTGGAGCAC TTATACTCCT GGTTGTTGGG 
97 51 TGTGCTTGGC GCTCAGGCGC AAGTGTGTGT AATTTGTCCT CCAGATCGCA 
ACACGAACCG CGAGTCCGCG TTCACACACA TTAAACAGGA GGTCTAGCGT 
9801 GCAATCGCGC CCCTATCTTG GCCCGCCCAC CTACTTATGC AGGTATTCCC 
CGTTAGCGCG GGGATAGAAC CGGGCGGGTG GATGAATACG TCCATAAGGG 
9851 CGGGGTGCCA TTAGTGGTTT TGTGGGCAAG TGGTTTGACC GCAGTGGTTA 
GCCCCACGGT AATCACCAAA ACACCCGTTC ACCAAACTGG CGTCACCAAT 
9901 GCGGGGTTAC AATCAGCCAA GTTATTACAC CCTTATTTTA CAGTCCAAAA 
CGCCCCAATG TTAGTCGGTT CAATAATGTG GGAATAAAAT GTCAGGTTTT 
9951 CCGCAGGGCG GCGTGTGGGG GCTGACGCGT GCCCCCACTC CACAATTTCA 
GGCGTCCCGC CGCACACCCC CGACTGCGCA CGGGGGTGAG GTGTTAAAGT 
10001 AAAAAAAGAG TGGCCACTTG TCTTTGTTTA TGGGCCCCAT TGGCGTGGAG 
TTTTTTTCTC ACCGGTGAAC AGAAACAAAT ACCCGGGGTA ACCGCACCTC 
10051 CCCCGTTTAA TTTTCGGGGG TGTTAGAGAC AACCAGTGGA GTCCGCTGCT 
GGGGCAAATT AAAAGCCCCC ACAATCTCTG TTGGTCACCT CAGGCGACGA 
10101 GTCGGCGTCC ACTCTCTTTC CCCTTGTTAC AAATAGAGTG TAACAACATG 
CAGCCGCAGG TGAGAGAAAG GGGAACAATG TTTATCTCAC ATTGTTGTAC 
10151 GTTCACCTGT CTTGGTCCCT GCCTGGGACA CATCTTAATA ACCCCAGTAT 
CAAGTGGACA GAACCAGGGA CGGACCCTGT GTAGAATTAT TGGGGTCATA 
10201 CATATTGCAC TAGGATTATG TGTTGCCCAT AGCCATAAAT TCGTGTGAGA 
GTATAACGTG ATCCTAATAC ACAACGGGTA TCGGTATTTA AGCACACTCT 
10251 TGGACATCCA GTCTTTACGG CTTGTCCCCA CCCCATGGAT TTCTATTGTT 
ACCTGTAGGT CAGAAATGCC GAACAGGGGT GGGGTACCTA AAGATAACAA 
10301 AAAGATATTC AGAATGTTTC ATTCCTACAC TAGTATTTAT TGCCCAAGGG 
TTTCTATAAG TCTTACAAAG TAAGGATGTG ATCATAAATA ACGGGTTCCC 
10351 GTTTGTGAGG GTTATATTGG TGTCATAGCA CAATGCCACC ACTGAACCCC 
CAAACACTCC CAATATAACC ACAGTATCGT GTTACGGTGG TGACTTGGGG 
10401 CCGTCCAAAT TTTATTCTGG GGGCGTCACC TGAAACCTTG TTTTCGAGCA 
GGCAGGTTTA AAATAAGACC CCCGCAGTGG ACTTTGGAAC AAAAGCTCGT 
10451 CCTCACATAC AC CTTACTGT TCACAACTCA GCAGTTATTC TATTAGCTAA 
GGAGTGTATG TGGAATGACA AGTGTTGAGT CGTCAATAAG ATAATCGATT 
10501 ACGAAGGAGA ATGAAGAAGC AGGCGAAGAT TCAGGAGAGT TCACTGCCCG 
TGCTTCCTCT TACTTCTTCG TCCGCTTCTA AGTCCTCTCA AGTGACGGGC 
10551 CTCCTTGATC TTCAGCCACT GCCCTTGTGA CTAAAATGGT TCACTACCCT 
GAGGAACTAG AAGTCGGTGA CGGGAACACT GATTTTACCA AGTGATGGGA 
10601 CGTGGAATCC TGACCCCATG TAAATAAAAC CGTGACAGCT CATGGGGTGG 
GCACCTTAGG ACTGGGGTAC ATTTATTTTG GCACTGTCGA GTACCCCACC 
10651 GAGATATCGC TGTTCCTTAG GACCCTTTTA CTAACCCTAA TTCGATAGCA 
CTCTATAGCG ACAAGGAATC CTGGGAAAAT GATTGGGATT AAGCTATCGT 
10701 TATGCTTCCC GTTGGGTAAC ATATGCTATT GAATTAGGGT TAGTCTGGAT 
ATACGAAGGG CAACCCATTG TATACGATAA CTTAATCCCA ATC AG AC CT A 
10751 AGTATATACT ACTACCCGGG AAGCATATGC TACCCGTTTA GGGTTAACAA 
TCATATATGA TGATGGGCCC TTCGTATACG ATGGGCAAAT C C C AATTGTT 



26/51 



FIG. 10-1 1 



pCEP4W/hepEK 



10801 GGGGGCCTTA TAAACACTAT TGCTAATGCC CTCTTGAGGG TCCGCTTATC 
CCCCCGGAAT ATTTGTG ATA ACGATTACGG GAGAACTCCC AGGCGAATAG 
10851 GGTAGCTACA CAGGCCCCTC TGATTGACGT TGGTGTAGCC TCCCGTAGTC 
CCATCGATGT GTCCGGGGAG ACTAACTGCA ACCACATCGG AGGGCATCAG 
10901 TTCCTGGGCC CCTGGGAGGT ACATGTCCCC CAGCATTGGT GTAAGAGCTT 
AAGGACCCGG GGACCCTCCA TGTACAGGGG GTCGTAACCA CATTCTCGAA 
10951 CAGCCAAGAG TTACACATAA AGGCAATGTT GTGTTGCAGT CCACAGACTG 

GTCGGTTCTC AATGTGTATT TCCGTTACAA CACAACGTCA GGTGTCTGAC - 
11001 CAAAGTCTGC TCCAGGATGA AAGCCACTCA GTGTTGGCAA ATGTGCACAT 
GTTTCAGACG AGGTCCTACT TTCGGTGAGT CACAACCGTT TACACGTGTA 
11051 CCATTTATAA GGATGTCAAC TACAGTCAGA GAACCCCTTT GTGTTTGGTC 
GGTAAATATT CCTACAGTTG ATGTCAGTCT CTTGGGGAAA CACAAACCAG 
11101 CCCCCCCGTG TCACATGTGG AACAGGGCCC AGTTGGCAAG TTGTACCAAC 
GGGGGGGCAC AGTGTACACC TTGTCCCGGG TCAACCGTTC AACATGGTTG 
11151 CAACTGAAGG GATTACATGC ACTGCCCCGC GAAGAAGGGG CAGAGATGCC 
GTTGACTTCC CTAATGTACG TGACGGGGCG CTTCTTCCCC GTCTCTACGG 
11201 GTAGTCAGGT TTAGTTCGTC CGGCGGCGGG GCTCTAGAGT CGACCGGTCA 
CATCAGTCCA AATCAAGCAG GCCGCCGCCC CGAGATCTCA GCTGGCCAGT 
11251 TGGCTGCGCC CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT 
ACCGACGCGG GGCTGTGGGC GGTTGTGGGC GACTGCGCGG GACTGCCCGA 
11301 TGTCTGCTCC CGGCATCCGC TT AC AG AC AA GCTGTGACCG TCTCCGGGAG 
ACAGACGAGG GCCGTAGGCG AATGTCTGTT CGACACTGGC AGAGGCCCTC 
11351 CTGCATGTGT CAGAGGTTTT CACCGTCATC ACCGAAACGC GCGAGGCAGC 
GACGTACACA GTCTCCAAAA GTGGCAGTAG TGGCTTTGCG CGCTCCGTCG 
11401 CGGATCATAA TCAGCCATAC CACATTTGTA GAGGTTTTAC TTGCTTTAAA 
GCCTAGTATT AGTCGGTATG GTGTAAACAT CTCCAAAATG AACGAAATTT 
11451 AAACCTCCCC ACCTCCCCCT GAACCTGAAA CATAAAATGA ATGCAATTGT 
TTTGGAGGGG TGGAGGGGGA CTTGGACTTT GTATTTTACT TACGTTAACA 
; 11501 TGTTGTTAAC TTGTTTATTG CAGCTTATAA TGGTTACAAA TAAAGCAATA 
ACAACAATTG AACAAATAAC GTCGAATATT ACCAATGTTT ATTTCGTTAT 
11551 GCATCACAAA TTTCACAAAT AAAGCATTTT TTTCACTGCA TTCTAGTTGT 
CGTAGTGTTT AAAGTGTTTA TTTCGTAAAA AAAGTGACGT AAGATCAACA 
11601 GGTTTGTCCA AACTCATCAA TGTATCTTAT CATGTCTGGA TCCCACGTGC 
CCAAACAGGT TTGAGTAGTT ACATAGAATA GTACAGACCT AGGGTGCACG 
11651 AGGCGGGGAG GCGGCCCAAA GGGAGATCCG ACTCGTCTGA GGGCGAAGGC 
TCCGCCCCTC CGCCGGGTTT CCCTCTAGGC TGAGCAGACT CCCGCTTCCG 
11701 GAAGACGCGG AAGAGGCCGC AGAGCCGGCA GCAGGCCGCG GGAAGGAAGG 
CTTCTGCGCC TTCTCCGGCG TCTCGGCCGT CGTCCGGCGC CCTTCCTTCC 
117 51 TCCGCTGGAT TGAGGGCCGA AGGGACGTAG CAGAAGGACG TCCCGCGCAG 
AGGCGACCTA ACTCCCGGCT TCCCTGCATC GTCTTCCTGC AGGGCGCGTC 
11801 AATCCAGGTG GCAACACAGG CGAGCAGCCA AGGAAAGGAC GATGATTTCC 
TTAGGTCCAC CGTTGTGTCC GCTCGTCGGT TCCTTTCCTG CTACTAAAGG 
11851 CCGACAACAC CACGGAATTG TCAGTGCCCA ACAGCCGAGC CCCTGTCCAG 
GGCTGTTGTG GTGCCTTAAC AGTCACGGGT TGTCGGCTCG GGGACAGGTC 
11901 CAGCGGGCAA GGCAGGCGGC GATGAGTTCC GCCGTGGCAA TAGGGAGGGG 
GTCGCCCGTT CCGTCCGCCG CTACTCAAGG CGGCACCGTT ATCCCTCCCC 
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11951 GAAAGCGAAA GTCCCGGAAA GGAGCTGACA GGTGGTGGCA ATGCCCCAAC 

CTTTCGCTTT CAGGGCCTTT CCTCGACTGT CCACCACCGT TACGGGGTTG 
12001 CAGTGGGGGT TGCGTCAGCA AACACAGTGC ACACCACGCC ACGTTGCCTG 

GTCACCCCCA ACGCAGTCGT TTGTGTCACG TGTGGTGCGG TGCAACGGAC 
12051 ACAACGGGCC ACAACTCCTC ATAAAGAGAC AGCAACCAGG ATTTATACAA 

TGTTGCCCGG TGTTGAGGAG TATTTCTCTG TCGTTGGTCC TAAATATGTT 
12101 GGAGGAGAAA ATGAAAGCCA TACGGGAAGC AATAGCATGA T AC AAAGGC A 

CCTCCTCTTT TACTTTCGGT ATGCCCTTCG TTATCGTACT ATGTTTCCGT 
12151 TTAAAGCAGC GTATCCACAT AGCGTAAAAG GAGCAACATA GTTAAGAATA 

AATTTCGTCG CATAGGTGTA TCGCATTTTC CTCGTTGTAT CAATTCTTAT 
12201 CCAGTCAATC TTTCACAAAT TTTGTAATCC AGAGGTTGAT TC 

GGTCAGTTAG AAAGTGTTTA AAACATTAGG TCTCCAACTA AG 



28/51 



FIG. 1 1— I 

pCEP4W/hep36 

1 TCGAGCGGCC GCTTTAAACT CAATGGTGAT GGTGATGATG ACCGGTACGC 
* AGCTCGCCGG CGAAATTTGA G TT AC C ACTA CCACTACTAC TGGCCATGCG 
-3 HHHHHHGTRT 
51 GTAGAATCGA GACCGAGGAG AGGGTTAGGG ATAGGCTTAC CGAATTCGAG 
CATCTTAGCT CTGGCTCCTC TCCCAATCCC TATCCGAATG GCTTAAGCTC 
-3 -TSD L GL L PNP I PK G F E L 
101 CTGGGTCACC ATGCCGCTGG CTTCGGAGTG AGTCTTTATG GCCTGGAAGA 
GACCCAGTGG TACGGCGACC GAAGCCTCAC TCAGAAATAC CGGACCTTCT 
-3 QTV MGS A ES H TKI A Q F I 
151 TCCACTCCCG GAAGTCACTG ACTTTGGTGT AGACGCCTGG CTTCTGGGCC 
AGGTGAGGGC CTTCAGTGAC TGAAACCACA TCTGCGGACC GAAGACCCGG 
-3 -IWE R FDS V KT Y VG P KQAL 
201 AGGGCACAGC CAGTGCCCCA ACTCACAATG CCACACAGCC GCCAACGTGG 
TCCCGTGTCG GTCACGGGGT TGAGTGTTAC GGTG TGTCGG CGGTTGCACC 
-3 LAC G TG W SVI GCL RWRP 
251 CGTCCGAGAG ATGCTGTCCT CACACACAAA GGGACCACCG CTGTCGCCCT 
GCAGGCTCTC TACGACAGGA GTGTGTGTTT CCCTGGTGGC GACAGCGGGA 
-3 TRS I SDECVFPGGSDGQ- 
301 GGCAGGCATC AATGCCACCC TCGGGGTAGC CAGCACAGAA CATCTTGGGC 
CCGTCCGTAG TTACGGTGGG AGCCCCATCG GTCGTGTCTT GTAGAACCCG 
-3 • Q C A D IGG E PY G A C F MKPK 
351 TTGATCTGGT TTCCATAGAA ATCAGCGCCA TTGCAGACAT CATTGCTGAT 
AACTAGACCA AAGGTATCTT TAGTCGC GGT AACGTCTGTA GTAACGACTA 
-3 -KIQ N GY F DA G N CV D NS I 
401 T ATGGGG AC T CGAGCCTCCT GGAGTACGCC GGCCTGTTGG CCATAGTACT 
ATACCCCTGA GCTCGGAGGA CCTCATGGGG CCGGACAAC C GGTATCATGA 
-3 IPV R AEQ L V. G AQQG YYQ 
451 GCGTGTTGCC CCAGCCCGTC ACGGTACAGA TCTTGCCATC CACCAGGGCC 
CGCACAACGG GGTCGGGCAG TGCCATGTCT AGAACGGTAG GTGGTCCCGG 
-3 • Q T N GWGT V TC t KG D VLAQ 
501 TGGCCGGCAG CTGGGAGGCA CACAGGCTGG ATGTATTCTG TGAGGGGCAG 
ACCGGCCGTC GACCCTCCGT GTGTCCGACC TACATAAGAC ACTCCCCGTC 
-3 -QGA A PL C VPQ » YE T LPL 
551 GGGACTGGAG AGGTGGACCA GGGCAATATC GTTGCTGTTC TCCTCGCTGT 
CCCTGACCTC TCCACCTGGT CCCGTTATAG CAACGACAAG AGGAGCGACA 
-3 PSSLHVLAIDNSNEESN- 
601 TGGGGTCCCG AAAGGGAAGA TAGCCCCCGT GGTAGACCAC AGCCTGCACC 
ACCCCAGGGC TTTCCCTTCT ATCGGGGGCA CCATCTGGTG TCG GACGTGG 
-3 • N P D R FPL Y GG H YV V AQVG 
651 CCCAGCTGCA GACCGTGGGG AGAGGCCTGG GCCACGGCAC CGGCAAACAC 
GGGTCGACGT CTGGCACCCC TCTCCGGACC CGGT GCCGTG GCCGTTTGTG 
-3 GLQ L GH P SAG A VA G A F V 
7 01 TCGCCATCGG GACAGGACCC GGTTCCGCTC CGGGAAGCAG TGGGCGGCTG 
AGCGGTAGCC CTGTCCTGGG CCAAGGCGAG GCCCTTCGTC ACCCGCCGAC 



-3 



RWR S LV R NR E PFC H A AT* 
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FIG. 11-2 

pCEP4W/hep36 

7 51 TCAGCACCCA GTCCCCGGAG AGCAGGGATC CCCCACAGAG GTGTGCTCCA . 
AGTCGTGGGT CAGGGGCCTC TCGTC C C TAG GGGGTGTCTC CACACGAGGT 
-3 TLVWDGS L LSGGC L HAGD 
801 TCATAGCGAA GGCTGACTTG CCACGGCCAC CGGCCCAAGC TGGTGTCCCG 
AGTATCGCTT CCGACTGAAC GGTGCCGGTG GCCGGGTTCG ACC AC AGGGC 
-3 DYR L SV QWPWR GL S T DR 
851 GCCTCCCACG ATCTTGTCGT CGTCGTCCAC GGGCAGCTTC CTGCGGCCAC 
CGGAGGGTGC TAGAACAGCA GCAGCAGGTG CCCGTCGAAG GACGCCGGTG 
-3 GGV I KDDDDVPLKRRGC 
901 AGTCTTGGTC CGGAGCGTCA CCAGTGGAAC CTGGAACCCA GAGCAGCAGT 
TCAGAACCAG GCCTCGC AGT GGTCACCTTG GACCTTGGGT CTCGTCGTCA 
-3 C D Q D PAD G TSG PVW LLLV 
951 ACCCATAGCA GGAGTGTGTC TGTCTCCATG GTGGCGATCT GGTACCCAGC 
TGGGTATCGT CCTCACACAG AC AG AGGT A C CACCGCTAGA CCATGGGTCG 
-3 VWL L LT D TEM 
1001 TTCTAGAGAT CTGACGGTTC ACTAAACGAG CTCTGCTTAT ATAGACCTCC 
AAGATCTCTA GACTGCCAAG TGATTTGCTC GAGACGAATA TATCTGGAGG 
1051 CACCGTACAC GCCTACCGCC CATTTGCGTC AACGGGGCGG GGTTATTACG 
GTGGCATGTG CGGATGGCGG GTAAACGCAG TTGCCCCGCC CCAATAATGC 
1101 ACATTTTGGA AAGTCCCGTT GATTTTGGTG CCAAAACAAA CTCCCATTGA 
TGTAAAACCT TTCAGGGCAA CTAAAACCAC GGTTTTGTTT GAGGGTAACT 
1151 CGTCAATGGG GTGGAGACTT GGAAATCCCC GTGAGTCAAA CCGCTATCCA 
GCAGTTACCC CACCTCTGAA CCTTTAGGGG CACTCAGTTT GGCGATAGGT 
1201 CGCCCATTGG TGTACTGCCA AAACCGCATC ACCATGGTAA TAGCGATGAC 
GCGGGTAACC ACATGACGGT TTTGGCGTAG TGGTACCATT ATCGCTACTG 
1251 TAATACGTAG ATGTACTGCC AAGTAGGAAA GTCCCGTAAG GTCATGTACT 
ATT ATG CATC TACATGACGG TTCATCCTTT CAGGGCATTC CAGTACATGA 
1301 GGGCATAATG CCAGGCGGGC CATTTACCGT CATTGACGTC AATAGGGGGC 
CCCGTATTAC GGTCCGCCCG GTAAATGGCA GTAACTGCAG TTATCCCCCG 
1351 GGACTTGGCA TATGATACAC TTGATGTACT GCCAAGTGGG CAGTTTACCG 
CCTGAACCGT ATACTATGTG AACTACATGA CGGTTCACCC GTCAAATGGC 
1401 TAAATACTCC ACCCATTGAC GTCAATGGAA AGTCCCTATT GGCGTTACTA 
ATTTATGAGG TGGGTAACTG CAGTTACCTT TCAGGGATAA CCGCAATGAT 
1451 TGGGAACATA CGTCATTATT GACGTCAATG GGCGGGGGTC GTTGGGCGGT 
ACCCTTGTAT GCAGTAATAA CTGCAGTTAC CCGCCCCCAG CAACCCGCCA 
1501 CAGCCAGGCG GGCCATTTAC CGTAAGTTAT GTAACGCGGA ACTCCATATA 
GTCGGTCCGC CCGGTAAATG GCATTCAATA CATTGCGCCT TGAGGTATAT 
1551 TGGGCTATGA ACTAATG AC C CCGTAATTGA TTACTATTAA TAACTAGTCA 
ACCCGATACT TGATTACTGG GGCATTAACT AATGATAATT ATTGATCAGT 
1601 ATAATCAATG TCAACATGGC GGTCATATTG G AC ATGAGC C AATATAAATG 
TATTAGTTAC AGTTGTACCG CCAGTATAAC CTGTACTCGG TTATATTTAC 
1651 TACATATTAT GATATAGATA CAACGTATGC AATGGCCAAT AGCCAATATT 
ATGTATAATA CTATATCTAT GTTGCATACG TTACCGGTTA TCGGTTATAA 
17 01 GATTTATGCT ATATAACCAA TGACTAATAT GGCTAATTGC CAATATTGAT 
CTAAATACGA TATATTGGTT ACTGATTATA CCGATTAACG GTTATAACTA 
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TCAATGTATA GATCTTCCAT ACCTACCAGT TCTGCGCCTG CAGCAATGCA 
AGTTACATAT CTAGAAGGTA TGGATGGTCA AGACGCGGAC GTCGTTACGT 
ACAACGTTGC CCGGATCTGC GATGATAAGC TGTCAAACAT GAGAATTGGT 
TGTTGCAACG GGCCTAGACG CTACTATTCG ACAGTTTGTA CTCTTAACCA 
CGACTAGCTT GGCACGCCAG AAATCCGCGC GGTGGTTTTT GGGGGTCGGG 
GCTGATCGAA CCGTGCGGTC TTTAGGCGCG CCACCAAAAA CCCCCAGCCC 
GGTGTTTGGC AGCCACAGAC GCCCGGTGTT CGTGTCGCGC CAGTACATGC 
CCACAAACCG TCGGTGTCTG CGGGCCACAA GCACAGCGCG GTCATGTACG 
GGTCCATGCC CAGGCCATCC AAAAACCATG GGTCTGTCTG CTCAGTCCAG 
CCAGGTACGG GTCCGGTAGG TTTTTGGTAC CCAGACAGAC GAGTCAGGTC 
TCGTGGACCA GACCCCACGC AACGCCCAAA ATAATAACCC CCACGAACCA 
AGCACCTGGT CTGGGGTGCG TTGCGGGTTT TATTATTGGG GGTGCTTGGT 
TAAACCATTC CCCATGGGGG ACCCCGTCCC TAACCCACGG GGCCAGTGGC 
ATTTGGTAAG GGGTACCCCC TGGGGCAGGG ATTGGGTGCC CCGGTCACCG 
TATGGCAGGG CCTGCCGCCC CGACGTTGGC TGCGAGCCCT GGGCCTTCAC 
ATACCGTCCC GGACGGCGGG GCTGCAACCG ACGCTCGGGA CCCGGAAGTG 
CCGAACTTGG GGGGTGGGGT GGGGAAAAGG AAGAAACGCG GGCGTATTGG 
GGCTTGAACC CCCCACCCCA CCCCTTTTCC TTCTTTGCGC CCGCATAACC 
CCCCAATGGG GTCTCGGTGG GGT ATCG AC A GAGTGCCAGC CCTGGGACCG 
GGGGTTACCC CAGAGCCACC CCATAGCTGT CTCACGGTCG GGACCCTGGC 
AACCCCGCGT TTATGAACAA ACGACCCAAC ACCCGTGCGT TTTATTCTGT 
TTGGGGCGCA AATACTTGTT TGCTGGGTTG TGGGCACGCA AAATAAGACA 
CTTTTTATTG CCGTCATAGC GCGGGTTCCT TCCGGTATTG TCTCCTTCCG 
GAAAAATAAC GGCAGTATCG CGCCCAAGGA AGGCCATAAC AGAGGAAGGC 
TGTTTCAGTT AGCCTCCCCC ATCTCCCCTA TTCCTTTGCC CTCGGACGAG 
ACAAAGTCAA TCGGAGGGGG TAGAGGGGAT AAGGAAACGG GAGCCTGCTC 
TGCTGGGGCG TCGGTTTCCA CTATCGGCGA GTACTTCTAC ACAGCCATCG 
ACGACCCCGC AGCCAAAGGT GATAGCCGCT CATGAAGATG TGTCGGTAGC 
GTCCAGACGG CCGCGCTTCT GCGGGCGATT TGTGTACGCC CGACAGTCCC 
CAGGTCTGCC GGCGCGAAGA CGCCCGCTAA ACACATGCGG GCTGTCAGGG 
GGCTCCGGAT CGGACGATTG CGTCGCATCG ACCCTGCGCC CAAGCTGCAT 
CCGAGGCCTA GCCTGCTAAC GCAGCGTAGC TGGGACGCGG GTTCGACGTA 
CATCGAAATT GCCGTCAACC AAGCTCTGAT AGAGTTGGTC AAGACCAATG 
GTAGCTTTAA CGGCAGTTGG TTCGAGACTA TCTCAACCAG TTCTGGTTAC 
CGGAGCATAT ACGCCCGGAG CCGCGGCGAT CCTGCAAGCT CCGGATGCCT 
GCCTCGTATA TGCGGGCCTC GGCGCCGCTA GGACGTTCGA GGCCTACGGA 
CCGCTCGAAG TAGCGCGTCT GCTGCTCCAT ACAAGCCAAC CACGGCCTCC 
GGCGAGCTTC ATCGCGCAGA CGACGAGGTA TGTTCGGTTG GTGCCGGAGG 
AGAAGAAGAT GTTGGCGACC TCGTATTGGG AATCCCCGAA CATCGCCTCG 
TCTTCTTCTA CAACCGCTGG AGCATAACCC TTAGGGGCTT GTAGCGGAGC 
CTCCAGTCAA TGACCGCTGT TATGCGGCCA TTGTCCGTCA GGACATTGTT 
GAGGTCAGTT ACTGGCGACA ATACGCCGGT AACAGGCAGT CCTGTAACAA 
GGAGCCGAAA TCCGCGTGCA CGAGGTGCCG GACTTCGGGG CAGTCCTCGG 
CCTCGGCTTT AGGCGCACGT GCTCCACGGC CTGAAGCCCC GTCAGGAGCC 
CCCAAAGCAT CAGCTCATCG AGAGCCTGCG CGACGGACGC ACTGACGGTG 
GGGTTTCGTA GTCGAGTAGC TCTCGGACGC GCTGCCTGCG TGACTGCCAC 



31/51 



FIG. 1 1-4 



pCEP4W/hep36 



2901 

2951 

3001 

3051 

3101 

3151 

3201 

3251 

3301 

3351 

3401 

3451 

3501 

3551 

3601 

3651 

3701 

3751 

3801 

3851 

3901 

3951 

4001 



TCGTCCATCA 
AGCAGGTAGT 
TATGAAATCA 
ATACTTTAGT 

GGCCGAACCC 
CCGGCTTGGG 
GCCTCCGCGA 
CGGAGGCGCT 
TTGCAACGTG 
AACGTTGCAC 
CGCTGAATTC 
GCGACTTAAG 
GCAAAGTGCC 
CGTTTCACGG 
ATTTACCCGC 
TAAATGGGCG 
GAGATTCTTC 
CTCTAAGAAG 
TTTTCGATCA 
AAAAGCTAGT 
CATATCTCAT 
GTATAGAGTA 
GTCGCTGCAG 
CAGCGACGTC 
GCGAAGTGGA 
CGCTTCACCT 
TTCGCCAATG 
AAGCGGTTAC 
GACATGCAAA 
CTGTACGTTT 
GGGCCACGGG 
CCCGGTGCCC 
TTGCTGGCGT 
AACGACCGCA 

TCTCGCTTCC 
AGAGCGAAGG 
GGCAGGTAGA 
CCGTCCATCT 
CTTACCAGCC 
GAATGGTCGG 
TGCCGCCTCG 
ACGGCGGAGC 
CCCTATACCT 
GGGATATGGA 

GCCACCTCGA 
CGGTGGAGCT 



CAGTTTGCCA 

GTCAAACGGT 

CGCCATGTAG 

GCGGTACATC 

GCTCGTCTGG 

CGAGCAGACC 

CCGGCTGCAG 

GGCCGACGTC 

ACACCCTGTG 

TGTGGGACAC 

CCCAATGTCA 

GGGTTACAGT 

GATAAACATA 

CTATTTGTAT 

AGGACATATC 

TCCTGTATAG 

GCCCTCCGAG 

CGGGAGGCTC 

GAAACTTCTC 

CTTTGAAGAG 

TGCCCGGGAT 

ACGGGCCCTA 

GGTCGCTCGG 

CCAGCGAGCC 

CCTGGGACCG 

GGACCCTGGC 

ACAAGACGCT 

TGTTCTGCGA 

TATATTTCTT 

ATATAAAGAA 

GATGAAGCAG 

CTACTTCGTC 

TCGCGACGCG 

AGCGCTGCGC 

GGCGGCATCG 

CCGCCGTAGC 

TGAGGACCAT 

ACTGCTGGTA 

TAACTTCGAT 

ATTGAAGCTA 

GCGAGCACAT 

CGCTCGTGTA 

TGTCTGCCTC 

ACAGACGGAG 

CCTGAATGGA 

GGACTTACCT 



GTGATACACA 
CACTATGTGT 

TGTATTGACC 

ACATAACTGG 

CTAAGATCGG 

GATTCTAGCC 

AACAGCGGGC 

TTGTCGCCCG 

CACGGCGGGA 

GTGCCGCCCT 

AGCACTTCCG 

TCGTGAAGGC 

ACGATCTTTG 

TGCTAGAAAC 

CACGCCCTCC 

GTGCGGGAGG 

AGCTGCATCA 

TCGACGTAGT 

GACAGACGTC 

CTGTCTGCAG 

CTGCGGCACG 

GACGCCGTGC 

TGTTCGAGGC 

ACAAGCTCCG 

CGCCGCCCCG 

GCGGCGGGGC 

GGGCGGGGTT 

CCCGCCCCAA 

CCGGGGACAC 

GGCCCCTGTG 

GGCATGGCGG 

CCGTACCGCC 

AGGCTGGATG 

TCCGACCTAC 

GGATGCCCGC 

CCTACGGGCG 

CAGGGACAGC 

GTCCCTGTCG 

CACTGGACCG 

GTGACCTGGC 

GGAACGGGTT 

CCTTGCCCAA 

CCCGCGTTGC 

GGGCGCAACG 

AGCCGGCGGC 

TCGGCCGCCG 



TGGGGATCAG 
ACCCCTAGTC 
GATTCCTTGC 
CTAAGGAACG 
CCGCAGCGAT 
GGCGTCGCTA 
AGTTCGGTTT 
TCAAGCCAAA 
GATGCAATAG 
CTACGTTATC 
GAATCGGGAG 
CTTAGCCCTC 
TAGAAACCAT 
ATCTTTGGTA 
TACATCGAAG 
ATGTAGCTTC 

GGTCGGAGAC 
CCAGCCTCTG 

GCGGTGAGTT 
CGCCACTCAA 
CTGTTGACGC 
GACAACTGCG 
CACACGCGTC 
GTGTGCGCAG 
ACTGCATCTG 
TGACGTAGAC 
TGTGTCATCA 
ACACAGTAGT 
CGCCAGCAAA 
GCGGTCGTTT 
CCGACGCGCT 
GGCTGCGCGA 
GCCTTCCCCA 
CGGAAGGGGT 
GTTGCAGGCC 
CAACGTCCGG 
TTCAAGGATC 
AAGTTCCTAG 
CTGATCGTCA 
GACTAGCAGT 
GiGCATGGATT 
CCGTACCTAA 
GTCGCGGTGC 
CAGCGCCACG 
ACCTCGCTAA 
TGGAGCGATT 



CAATCGCGCA 
GTTAGCGCGT 

GGTCCGAATG 
CCAGGCTTAC 

CGCATCCATG 
GCGTAGGTAC 
C^GGCAGGTC 
GTCCGTCCAG 

GTCAGGCTCT 
CAGTCCGAGA 

CGCGGCCGAT 
GCGCCGGCTA 
CGGCGCAGCT 
GCCGCGTCGA 
CTGAAAGCAC 
GACTTTCGTG 
GCTGTCGAAC 
CGACAGCTTG 
CAGGCTTTTT 
GTCCGAAAAA 
TGTTAAGCGG 
ACAATTCGCC 
ACCTTAATAT 
TGGAATTATA 
CGTGTTCGAA 
GCACAAGCTT 
TAGAACTAAA 
ATCTTGATTT 
CGCGAGCAAC 
GCGCTCGTTG 
GGGCTACGTC 
CCCGATGCAG 
TTATGATTCT 
AATACTAAGA 
ATGCTGTCCA 
TACGACAGGT 
GCTCGCGGCT 
CGAGCGCCGA 
CGGCGATTTA 
GCCGCTAAAT 
GTAGGCGCCG 
CATCCGCGGC 
ATGGAGCCGG 
TACCTCGGCC 
CGGATTCACC 
GCCTAAGTGG 
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ACTCCAAGAA TTGGAGCCAA 
TGAGGTTCTT AACCTCGGTT 
AACCAACCCT TGGCAGAACA 
TTGGTTGGGA ACCGTCTTGT 



GCACGCGGCG 
CGTGCGCCGC 
CGTTTTTCCA 
GCAAAAAGGT 
TCAAGTCAGA 
AGTTCAGTCT 
TCCCCCTGGA 
AGGGGGACCT 
CCGGATACCT 
GGCCTATGGA 
AGCTCACGCT 
TCGAGTGCGA 
GGGCTGTGTG 
CCCGACACAC 
GTAACTATCG 
C ATTG ATAGC 
GCAGCAGCCA 
CGTCGTCGGT 
TACAGAGTTC 
ATGTCTCAAG 
TATTTGGTAT 
ATAAACCATA 

GGTAGCTCTT 
CCATCGAGAA 



CAGCAAAAGG 
GTCGTTTTCC 
TAGGCTCCGC 
ATCCGAGGCG 
GGTGGCGAAA 
CCACCGCTTT 
AGCTCCCTCG 
TCGAGGGAGC 
GTCCGCCTTT 
CAGGCGGAAA 
GTAGGTATCT 
CATCCATAGA 

CACGAACCCC 
GTGCTTGGGG 
TCTTGAGTCC 
AGAACTCAGG 



TC AATTCTTG 
AGTTAAGAAC 
TATCCATCGC 
ATAGGTAGCG 
CCAGGAACCG 
GGTCCTTGGC 
CCCCCTGACG 
GGGGGACTGC 
CCCGACAGGA 
GGGCTGTCCT 

TGCGCTCTCC 
ACGCGAGAGG 

CTCCCTTCGG 
GAGGGAAGCC 

CAGTTCGGTG 
GTCAAGCCAC 

CCGTTCAGCC 
GGCAAGTCGG 
AACCCGGTAA 
TTGGGCCATT 



CTGGTAACAG 
GACCATTGTC 
TTGAAGTGGT 
AACTTCACCA 
CTGCGCTCTG 
GACGCGAGAC 
GATCCGGCAA 
CTAGGCCGTT 



TGTTTGCAAG CAGCAGATTA 
ACAAACGTTC GTCGTCTAAT 
CTTTGATCTT TTCTACGGGG 
GAAACTAGAA AAGATGCCCC 



TAAGGGATTT 

ATTCCCTAAA 

TTTAAATTAA 

AAATTTAATT 

CTTGGTCTGA 

GAACCAGACT 

ATCTGTCTAT 

TAGACAGATA 

AACTACGATA 

TTGATGCTAT 

CGCGAGACCC 

GCGCTCTGGG 

GCCGGAAGGG 

CGGCCTTCCC 



TGGTCATGAG 
ACCAGTACTC 
AAATGAAGTT 
TTTACTTCAA 
CAGTTACCAA 
GTCAATGGTT 
TTCGTTCATC 
AAGCAAGTAG 
CGGGAGGGCT 
GCCCTCCCGA 

ACGCTCACCG 
TGCGAGTGGC 
CCGAGCGCAG 
GGCTCGCGTC 



GATTAGCAGA 
CTAATCGTCT 
GGCCTAACTA 
C CGGATTG AT 

CTGAAGCCAG 
GACTTCGGTC 
ACAAACCACC 
TGTTTGGTGG 
CGCGCAGAAA 
GCGCGTCTTT 
TCTGACGCTC 
AGACTGCGAG 
ATTATCAAAA 
TAATAGTTTT 
TTAAATCAAT 
AATTTAGTTA 



TGCTTAATCA 
ACGAATTAGT 
CATAGTTGCC 
GTATCAACGG 
TACCATCTGG 
ATGGTAGACC 
GCTCCAGATT 
CGAGGTCTAA 
AAGTGGTCCT 
TTCACCAGGA 



CGGAGAACTG 
GCCTCTTGAC 

GTCCGCCATC 
CAGGCGGTAG 
TAAAAAGGCC 
ATTTTTC CGG 
AGCATCACAA 
TCGTAGTGTT 
CTATAAAGAT 
GATATTTCTA 

TGTTCCGACC 
ACAAGGCTGG 

GAAGCGTGGC 
CTTCGCACCG 
TAGGTCGTTC 
ATCCAGCAAG 
CGACCGCTGC 
GCTGGCGACG 
GACACGACTT 
CTGTGCTGAA 

GCGAGGTATG 
CGCTCCATAC 
CGGCTACACT 
GCCGATGTGA 
TTACCTTCGG 
AATGGAAGCC 
GCTGGTAGCG 
CGACCATCGC 
AAAAGGATCT 
TTTTCCTAGA 
AGTGGAACGA 
TCACCTTGCT 
AGGATCTTCA 
TCCTAGAAGT 
CTAAAGTATA 
GATTTCATAT 
GTGAGGCACC 
CACTCCGTGG 
TGACTCCCCG 
ACTGAGGGGC 



TGAATGCGCA 
ACTTACGCGT 

TCCAGCAGCC 
AGGTCGTCGG 

GCGTTGCTGG 
CGCAACGACC 
AAATCGACGC 
TTTAGCTGCG 
ACCAGGCGTT 
TGGTCCGCAA 
CTGCCGCTTA 
GACGGCGAAT 

GCTTTCTCAT 
CGAAAGAGTA 
GCTCCAAGCT 
CGAGGTTCGA 
GCCTTATCCG 
CGGAATAGGC 



CCCCAGTGCT 
GGGGTCACGA 
TATCAGCAAT 
ATAGTCGTTA 
GCAACTTTAT 
CGTTGAAATA 



ATCGCCACTG 
TAGCGGTGAC 
TAGGCGGTGC 
ATCCGCCACG 
AGAAGGACAG 
TCTTCCTGTC 

AAAAAGAGTT 
TTTTTCTCAA 
GTGGTTTTTT 
CACCAAAAAA 
CAAGAAGATC 
GTTCTTCTAG 
AAACTCACGT 
TTTGAGTGCA 
CCTAGATCCT 
GGATCTAGGA 
TATGAGTAAA 
ATACTCATTT 

TATCTCAGCG 
ATAGAGTCGC 
TC GTGT AG AT 
AGCACATCTA 
GCAATGATAC 
CGTTACTATG 
AAACCAGCCA 
TTTGGTCGGT 
CCGCCTCCAT 
GGCGGAGGTA 
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CCAGTCTATT 

GGTCAGATAA 

ATAGTTTGCG 

TATCAAACGC 

TCGTCGTTTG 

AGCAGCAAAC 

AGTTACATGA 

TCAATGTACT 

CTCCGATCGT 

GAGGCTAGCA 

ATGGCAGCAC 

TACCGTCGTG 

TTCTGTGACT 

AAGACACTGA 

GGCGACCGAG 

CCGCTGGCTC 

CATAGCAGAA 

GTATCGTCTT 

AAAACTCTCA 

TTTTGAGAGT 

CTCGTGCACC 

GAGCACGTGG 

GGGTGAGCAA 

CCCACTCGTT 

GACACGGAAA 

CTGTGCCTTT 

GCATTTATCA 

CGTAAATAGT 

TAGAAAAATA 

ATCTTTTTAT 

ACCTGACGTC 

TGGACTGCAG 

GGCGTATCAC 

CCGCATAGTG 

TATCATCGAT 

ATAGTAGCTA 

GCCCCAGTAG 

CGGGGTCATC 

CCGAGAATCC 

GGCTCTTAGG 

AGGCCTGGCT 

TCCGGACCGA 

TTTTCCGGGC 

AAAAGGCCCG 

TGGGGAATTT 

ACCCCTTAAA 



AATTGTTGCC 
TTAACAACGG 
CAACGTTGTT 
GTTGCAACAA 
GTATGGCTTC 
CATACCGAAG 
TCCCCCATGT 
AGGGGGTACA 
TGTCAGAAGT 
ACAGTCTTCA 
TGCATAATTC 
AC GT ATTAAG 
GGTGAGTACT 
CCACTCATGA 
TTGCTCTTGC 
AACGAGAACG 
CTTTAAAAGT 
GAAATTTTCA 
AGGATCTTAC 
TCCTAGAATG 
CAACTGATCT 
GTTGACTAGA 
AAACAGGAAG 
TTTGTCCTTC 
TGTTGAATAC 
ACAACTTATG 
GGGTTATTGT 
CCCAATAACA 
AACAAATAGG 
TTGTTTATCC 
TAAGAAACCA 
ATTCTTTGGT 

GAGGCCCTTT 
CTCCGGGAAA 
AAGCTGATCC 
TTCGACTAGG 
CATCTCTGTC 
GTAGAGACAG 
CCATCCCTAC 
GGTAGGGATG 
TGAGGCTCAG 
ACTCCGAGTC 
TGCGAGTAAT 
ACGCTCATTA 
TCAGACCTGG 
AGTCTGGACC 



GGGAAGCTAG 
CCCTTCGATC 
GCCATTGCTG 
CGGTAACGAC 
ATTCAGCTCC 
TAAGTCGAGG 
TGTGCAAAAA 
ACACGTTTTT 

AAGTTGGCCG 
TTCAACCGGC 

TCTTACTGTC 

AGAATGACAG 

CAACCAAGTC 

GTTGGTTC AG 

CCGGCGTCAA 

GGCCGCAGTT 

GCTCATCATT 

CGAGTAGTAA 

CGCTGTTGAG 

GCGACAACTC 

TCAGCATCTT 

AGTCGTAGAA 

GCAAAATGCC 

CGTTTTACGG 

TCATACTCTT 

AGTATGAGAA 

CTCATGAGCG 

GAGTACTCGC 

GGTTCCGCGC 

CCAAGGCGCG 

TTATTATCAT 

AATAATAGTA 

CGTCTTCAAG 

GCAGAAGTTC 

TCACAGGCCG 

AGTGTCCGGC 

TGGTGACCTT 

ACCACTGGAA 

CGTCCAGCAA 

GCAGGTCGTT 

GACGCAAATC 

CTGCGTTTAG 

TGGTGATGAG 

ACCACTACTC 

ATCTGTCTGA 

TAGACAGACT 



AGTAAGTAGT 
TCATTCATCA 
CAGGCATCGT 
GTCCGTAGCA 

GGTTCCCAAC 
CCAAGGGTTG 
AGCGGTTAGC 
TCGCCAATCG 
CAGTGTTATC 
GTCACAATAG 
ATGCCATCCG 
TACGGTAGGC 
ATTCTGAGAA 
TAAGACTCTT 
CACGGGATAA 
GTGCCCTATT 

GGAAAACGTT 
CCTTTTGCAA 

ATCCAGTTCG 
TAGGTCAAGC 
TTACTTTCAC 
AATGAAAGTG 
GCAAAAAAGG 
CGTTTTTTCC 
CCTTTTTCAA 
GGAAAAAGTT 
GATACATATT 
CTATGTATAA 
ACATTTCCCC 
TGTAAAGGGG 
GACATTAACC 
CTGTAATTGG 
AATTCTCATG 
TTAAGAGTAC 
CACCCAGCTT 
GTGGGTCGAA 
GAAGAGGAAG 
CTTCTCCTTC 
AAAGGGGGAC 
TTTCCCCCTG 
TTGAGGATGT 
AACTCCTACA 
GACGAGGATG 
CTGCTCCTAC 
CAGCGACCAT 
GTCGCTGGTA 



TCGCCAGTTA 
AGCGGTCAAT 

GGTGTCACGC 
CCACAGTGCG 
GATCAAGGCG 
CTAGTTCCGC 
TCCTTCGGTC 
AGGAAGCCAG 
ACTCATGGTT 
TGAGTACCAA 
TAAGATGCTT 
ATTCTACGAA 

TAGTGTATGC 
ATCACATACG 
TACCGCGCCA 
ATGGCGCGGT 
CTTCGGGGCG 
GAAGCCCCGC 
ATGTAACCCA 
TACATTGGGT 
CAGCGTTTCT 
GTCGCAAAGA 
GAATAAGGGC 
CTTATTCCCG 
TATTATTGAA 
ATAATAACTT 
TGAATGTATT 
ACTTACATAA 
GAAAAGTGCC 
CTTTTCACGG 
TATAAAAATA 
ATATTTTTAT 
TTTGACAGCT 
AAACTGTCGA 
TTCTTCCGTT 
AAGAAGGCAA 
AGGAGGGGTC 
TCCTCCCCAG 
GAGGAATTTG 
CTCCTTAAAC 

TCAGCGGGAG 
AGTCGCCCTC 
GTTCGGAGGA 
CAAGCCTCCT 
GAAGGGGATG 
CTTCCCCTAC 
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63 51 AGGGTGGGGG 
TCCCACCCCC 

6401 CTGAGCGTCG 
GACTCGCAGC 

6451 TGTCTGACGA 
ACAGACTGCT 

6501 GGAGACACAT 
CCTCTGTGTA 

6551 AGGGGGTGAT 
TCCCCCACTA 

6601 GCGGAAGACC 
CGCCTTCTGG 

6651 GATGGTGTCC 
CTACCACAGG 

6701 GACCCACGGT 
CTGGGTGCCA 

67 51 GAGGGGCAGG 
CTCCCCGTCC 

6801 GCAGG AGGGG 
CGTCCTCCCC 

6851 AGGGGC AGG A 
TCCCCGTCCT 

6901 G AGG AGGGGC 
CTCCTCCCCG 

6951 GGGGCAGGAG 

CCCCGTCCTC 
7001 AGGAGCAGGA 

TCCTCGTCCT 
7 051 GGGCAGGAGC 

CCCGTCCTCG 
7101 GCAGGAGCAG 

CGTCCTCGTC 
7151 GGCAGGAGCA 

CCGTCCTCGT 

7201 GAGGGGCAGG 
CTCCCCGTCC 

7251 GGGGCAGGAG 
CCCCGTCCTC 

7301 AGG AGGGGC A 
TCCTCCCCGT 

7351 GAGGGGCAGG 
CTCCCCGTCC 

7401 GGAGCAGGAG 
CCTCGTCCTC 

7451 TGGAGGCCGG 
ACCTCCGGCC 



GGCTGTTGGA 
CCGACAACCT 

TCTAATAAAG 
AGATTATTTC 
GGGGCCAGGT 
CCCCGGTCCA 
CTGGACCAGA 
GACCTGGTCT 

AACCATGGAC 
TTGGTACCTG 
AGGAGCCCCG 
TCCTCGGGGC 
GGAGACCCCA 
CCTCTGGGGT 

GGAACAGGAG 
CCTTGTCCTC 
AGCAGGAGGA 
TCGTCCTCCT 
CAGGAGGGGC 
GTCCTCCCCG 

GGGGCAGGAG 
CCCCGTCCTC 
AGGAGGGGCA 
TCCTCCCCGT 
CAGGAGGAGG 
GTCCTCCTCC 
GGAGGGGCAG 
CCTCCCCGTC 
AGGAGGGGCA 
TCCTCCCCGT 
GAGGAGGGGC 
CTCCTCCCCG 
GGAGGGGCAG 
CCTCCCCGTC 
AGCAGGAGGG 
TCGTCCTCCC 
CAGGAGGAGG 
GTCCTCCTCC 
GGAGCAGGAG 
CCTCGTCCTC 
AGCAGGAGGG 
TCGTCCTCCC 
GGGCAGGAGC 
CCCGTCCTCG 
GGTCGAGGAG 
CCAGCTCCTC 



GGGGGCAGGA 
CCCCCGTCCT 
ATGTCTATTG 
TACAGATAAC 
ACAGGACCTG 
TGTCCTGGAC 
AGGCTCCGGC 
TCCGAGGCCG 
GAGGACGGGG 
CTCCTGCCCC 
GGCGGCTCAG 
CCGCCGAGTC 
AAAACGTCCA 
TTTTGCAGGT 
CAGGAGCAGG 
GTCCTCGTCC 
GGGGCAGGAG 
CCCCGTCCTC 
AGGAGCAGGA 
TCCTCGTCCT 
GGGCAGGAGC 
CCCGTCCTCG 
GGAGCAGGAG 
CCTCGTCCTC 
GGCAGGAGCA 
CCGTCCTCGT 

GAGGGGCAGG 
CTCCCCGTCC 
GGAGGGGCAG 
CCTCCCCGTC 
AGGAGGGGCA 
TCCTCCCCGT 

GAGCAGGAGG 
CTCGTCCTCC 
GCAGGAGGGG 
CGTCCTCCCC 
GGCAGGAGGG 
CCGTCCTCCC 
GGGCAGGAGG 
CCCGTCCTCG 

GCAGGAGGGG 
CGTCCTCCCC 
AGGAGGTGGA 
TCCTCCACCT 

GTAGTGGAGG 
CATCACCTCC 



GTCTGCACTC 
CAGACGTGAG 
ATCTCTTTTA 
TAGAGAAAAT 
GAAATGGCCT 
CTTTACCGGA 
GGCAGTGGAC 
CCGTCACCTG 
AAGAGGACGA 
TTCTCCTGCT 
GATCAGGGCC 
CTAGTCCCGG 
AGTTGCATTG 
TCAACGTAAC 
AGCGGGAGGG 
TCGCCCTCCC 
CAGGAGGAGG 
GTCCTCCTCC 
GGAGGGGCAG 
CCTCCCCGTC 

AGGAGGAGGG 
TCCTCCTCCC 
GAGGGGCAGG 
CTCCCCGTCC 

GGAGGAGGGG 
CCTCCTCCCC 
AGGGGCAGGA 
TCCCCGTCCT 
GAGGGGCAGG 
CTCCCCGTCC 
GGAGGGGCAG 
CCTCCCCGTC 
GGCAGGAGCA 
CCGTCCTCGT 
CAGGAGCAGG 
GTCCTCGTCC 
GCAGGAGCAG 
CGTCCTCGTC 
GGCAGGAGCA 
CCGTCCTCGT 
CAGGAGCAGG 
GTCCTCGTCC 
GGCCGGGGTC 
CCGGCCCCAG 
CCGGGGTCGA 
GGCCCCAGCT 



CCTGTATTCA 
GGACATAAGT 
GTGTGAATCA 
CACACTTAGT 
AGGAGAGAAG 
TCCTCTCTTC 
CTCAAAGAAG 
GAGTTTCTTC 
GGACGAGGAG 
CCTGCTCCTC 
AAGACATAGA 
TTCTGTATCT 
GCTGCAAAGG 
CGACGTTTCC 
GCAGGAGCAG 
CGTCCTCGTC 
GGCAGGAGGG 
CCGTCCTCCC 
GAGCAGGAGG 
CTCGTCCTCC 
GCAGGAGCAG 
CGTCCTCGTC 
AGGGGCAGGA 
TCCCCGTCCT 
CAGGAGGGGC 
GTCCTCCCCG 
GCAGGAGGAG 
CGTCCTCCTC 
AGCAGGAGGG 
TCGTCCTCCC 
GAGCAGGAGG 
CTCGTCCTCC 
GGAGGGGCAG 
CCTCCCCGTC 
AGGGGCAGGA 
TCCCCGTCCT 

GAGGAGGGGC 
CTCCTCCCCG 
GGAGGGGCAG 
CCTCCCCGTC 
AGGAGGGGCA 
TCCTCCCCGT 
GAGGAGGCAG 
CTCCTCCGTC 
GGAGGTAGTG 
CCTCCATCAC 
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7 501 GAGGCCGCCG GGGTAGAGGA CGTGAAAGAG CCAGGGGGGG AAGTCGTGAA 
CTCCGGCGGC CCCATCTCCT GCACTTTCTC GGTCCCCCCC TTCAGCACTT 
7 551 AGAGCCAGGG GGAGAGGTCG TGGACGTGGA GAAAAGAGGC CCAGGAGTCC 
TCTCGGTCCC CCTCTCCAGC ACCTGCACCT CTTTTCTCCG GGTCCTCAGG 
7 601 CAGTAGTCAG TCATCATCAT CCGGGTCTCC ACCGCGCAGG CCCCCTCCAG 
GTCATCAGTC AGTAGTAGTA GGCCCAGAGG TGGCGCGTCC GGGGGAGGTC 
7 651 GTAGAAGGCC ATTTTTCCAC CCTGTAGGGG AAGCCGATTA TTTTGAATAC 
CATCTTCCGG TAAAAAGGTG GGACATCCCC TTCGGCTAAT AAAACTTATG 
7701 CACCAAGAAG GTGGCCCAGA TGGTGAGCCT GACGTGCCCC CGGGAGCGAT 
GTGGTTCTTC CACCGGGTCT ACCACTCGGA CTGCACGGGG GCCCTCGCTA 
7751 AGAGCAGGGC CCCGCAGATG ACCCAGGAGA AGGCCCAAGC ACTGGACCCC 
TCTCGTCCCG GGGCGTCTAC TGGGTCCTCT TCCGGGTTCG TGACCTGGGG 
7801 GGGGTCAGGG TGATGGAGGC AGGCGCAAAA AAGGAGGGTG GTTTGGAAAG 
CCCCAGTCCC ACTACCTCCG TCCGCGTTTT TTCCTCCCAC CAAACCTTTC 
7851 CATCGTGGTC AAGGAGGTTC CAACCCGAAA TTTGAGAACA TTGCAGAAGG 
GTAGCACCAG TTCCTCCAAG GTTGGGCTTT AAACTCTTGT AACGTCTTCC 
7901 TTTAAGAGCT CTCCTGGCTA GGAGTCACGT AGAAAGGACT ACCGACGAAG 
AAATTCTCGA GAGGACCGAT CCTCAGTGCA TCTTTCCTGA TGGCTGCTTC 
7951 GAACTTGGGT CGCCGGTGTG TTCGTATATG GAGGTAGTAA GACCTCCCTT 
CTTGAACCCA GCGGCCACAC AAGCATATAC CTCCATCATT CTGGAGGGAA 
8001 TACAACCTAA GGCGAGGAAC TGCCCTTGCT ATTCCACAAT GTCGTCTTAC 
ATGTTGGATT CCGCTCCTTG ACGGGAACGA TAAGGTGTTA CAGCAGAATG 
8051 ACCATTGAGT CGTCTCCCCT TTGGAATGGC CCCTGGACCC GGCCCACAAC 
TGGTAACTCA GCAGAGGGGA AACCTTACCG GGGACCTGGG CCGGGTGTTG 
8101 CTGGCCCGCT AAGGGAGTCC ATTGTCTGTT ATTTCATGGT CTTTTTACAA 
GACCGGGCGA TTCCCTCAGG TAACAGACAA TAAAGTACCA GAAAAATGTT 
8151 ACTCATATAT TTGCTGAGGT TTTGAAGGAT GCGATTAAGG ACCTTGTTAT 
TGAGTATATA AACGACTCCA AAACTTC CT A CGCTAATTCC TGGAACAATA 
8201 GACAAAGCCC GCTCCTACCT GCAATATCAG GGTGACTGTG TGCAGCTTTG 
CTGTTTCGGG CGAGGATGGA CGTTATAGTC CCACTGACAC ACGTCGAAAC 
8251 ACGATGGAGT AGATTTGCCT CCCTGGTTTC CACCTATGGT GGAAGGGGCT 
TGCTACCTCA TCTAAACGGA GGGACCAAAG GTGGATACCA CCTTCCCCGA 
8301 GCCGCGGAGG GTGATGACGG AGATGACGGA GATGAAGGAG GTGATGGAGA 
CGGCGCCTCC CACTACTGCC TCTACTGCCT CTACTTCCTC CACTACCTCT 
8351 TGAGGGTGAG GAAGGGCAGG AGTGATGTAA CTTGTTAGGA GACGCCCTCA 
ACTCCCACTC CTTCCCGTCC TCACTACATT GAACAATCCT CTGCGGGAGT 
.8401 ATCGTATTAA AAGCCGTGTA TTCCCCCGCA CTAAAGAATA AATCCCCAGT 
TAGCATAATT TTCGGCACAT AAGGGGGCGT GATTTCTTAT TTAGGGGTCA 
8451 AGACATCATG CGTGCTGTTG GTGTATTTCT GGCCATCTGT CTTGTCACCA 
TCTGTAGTAC GCACGACAAC CACATAAAGA CCGGTAGACA GAACAGTGGT 
8501 TTTTCGTCCT CCCAACATGG GGCAATTGGG CATACCCATG TTGTCACGTC 
AAAAGCAGGA GGGTTGT AC C CCGTTAACCC GTATGGGTAC AACAGTGCAG 
8551 ACTCAGCTCC GCGCTCAACA CCTTCTCGCG TTGGAAAACA TTAGCGACAT 
TGAGTCGAGG CGCGAGTTGT GGAAGAGCGC AACCTTTTGT AATCGCTGTA 
8601 TTACCTGGTG AGCAATCAGA CATGCGACGG CTTTAGCCTG GCCTCCTTAA 
AATGG AC C AC TCGTTAGTCT GTACGCTGCC GAAATCGGAC CGGAGGAATT 
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8651 ATTCACCTAA GAATGGGAGC 
TAAGTGGATT CTTACCCTCG 
8701 AAAATTCACG CCCCCTTGGG 
TTTTAAGTGC GGGGGAACCC 
8751 CCACTCTACT ACTGGGTATC 
GGTGAGATGA TGACCCATAG 
8801 ATATGCTACC CGGATACAGA 
TATACGATGG GCCTATGTCT 
8851 TTAGGATAGC ATATGCTACC 
AATCCTATCG TATACGATGG 
8901 CAGATATAAA TTAGGATAGC 
GTCTATATTT AATCCTATCG 
8951 ATATGCTACC CAGATATAGA 
TATACGATGG GTCTATATCT 
9 001 TTAGGATAGC ATATGCTACC 
AATCCTATCG TATACGATGG 
9051 CAGATATTTG GGTAGTATAT 
GTCTATAAAC CCATCATATA 
9101 ACTACCCTAA TCTCTATTAG 
TGATGGGATT AGAGATAATC 
9151 GATAGCATAT ACTACCCAGA 
CTATCGTATA TGATGGGTCT 
9201 TATAGATTAG GATAGCCTAT 
ATATCTAATC CTATC GGATA 
9251 ACTACCCAGA TATAGATTAG 
TGATGGGTCT ATATCTAATC 
9301 GATAGCCTAT GCTACCCAGA 
CTATCGGATA CGATGGGTCT 
9351 TATTTGGGTA GTATATGCTA 
ATAAACCCAT CATATACGAT 
9401 AGCGACCTCG TGAATATGAG 
TCGCTGGAGC ACTTATACTC 
9451 GCAAGTGTGT GTAATTTGTC 
CGTTCACACA CATTAAACAG 
9501 TGGCCCGCCC ACCTACTTAT 
ACCGGGCGGG TGGATGAATA 
9551 TTTGTGGGCA AGTGGTTTGA 
AAACACCCGT TCACCAAACT 
9601 AAGTTATTAC ACCCTTATTT 
TTCAATAATG TGGGAATAAA 
9651 GGGCTGACGC GTGCCCCCAC 
CCCGACTGCG CACGGGGGTG 
97 01 TGTCTTTGTT TATGGGCCCC 
AC AG AAAC AA ATACCCGGGG 
97 51 GGTGTTAGAG ACAACCAGTG 
CCACAATCTC TGTTGGTCAC 



AACCAGCATG CAGGAAAAGG 
TTGGTCGTAC GTCCTTTTCC 
AGGTGGCGGC ATATGCAAAG 
TCCACCGCCG TATACGTTTC 
ATATGCTGAC TGTATATGCA 
TATACGACTG ACATATACGT 
TTAGGATAGC ATATACTACC 
AATCCTATCG TATATGATGG 
CAGATATAGA TTAGGATAGC 
GTCTATATCT AATCCTATCG 
ATATACTACC CAGATATAGA 
TATATGATGG GTCTATATCT 
TTAGGATAGC CTATGCTACC 
AATCCTATCG GATACGATGG 
CAGATATAGA TTAGGATAGC 
GTCTATATCT AATCCTATCG 
GCTACCCAGA TATAAATTAG 
CGATGGGTCT ATATTTAATC 
GATAGCATAT GCTACCCGGA 
CTATCGTATA CGATGGGCCT 
TATAGATTAG GATAGCATAT 
ATATCTAATC CTATCGTATA 
GCTACCCAGA TATAAATTAG 
CGATGGGTCT ATATTTAATC 
GATAGCATAT GCTACCCAGA 
CTATCGTATA CGATGGGTCT 
TATAGATTAG GATAGCATAT 
ATATCTAATC CTATCGTATA 
CCCATGGCAA CATTAGCCCA 
GGGTACCGTT GTAATCGGGT 
GACCAACAAC CCTGTGCTTG 
CTGGTTGTTG GGACACGAAC 
CTCCAGATCG CAGCAATCGC 
GAGGTCTAGC GTCGTTAGCG 
GCAGGTATTC CCCGGGGTGC 
CGTCCATAAG GGGCCCCACG 
CCGCAGTGGT TAGCGGGGTT 
GGCGTCACCA ATCGCCCCAA 
TACAGTCCAA AACCGCAGGG 
ATGTCAGGTT TTGGCGTCCC 
TCCACAATTT CAAAAAAAAG 
AGGTGTTAAA GTTTTTTTTC 
ATTGGCGTGG AGCCCCGTTT 
TAACCGCACC TCGGGGCAAA 
GAGTCCGCTG CTGTCGGCGT 
CTCAGGCGAC GACAGCCGCA 



ACAAGCAGCG 
TGTTCGTCGC 

GATAGCACTC 
CTATC GTG AG 

TGAGGATAGC 
ACTCCTATCG 
CAGATATAGA 
GTCTATATCT 
CTATGCTACC 
GATACGATGG 

TTAGGATAGC 
AATCCTATCG 
CAGATATAGA 
GTCTATATCT 
AT ATG CTATC 
TATACGATAG 
GATAGCATAT 
CTATCGTATA 
TACAGATTAG 
ATGTCTAATC 
GCTACCCAGA 
CGATGGGTCT 
GATAGCATAT 
CTATCGTATA 
TATAGATTAG 
ATATCTAATC 
GCTATCCAGA 
CGATAGGTCT 

CCGTGCTCTC 
GGCACGAGAG 
GCGCTCAGGC 
CGCGAGTCCG 
GCCCCTATCT 
CGGGGATAGA 
CATTAGTGGT 
GTAATCACCA 
ACAATCAGCC 
TGTTAGTCGG 
CGGCGTGTGG 
GCCGCACACC 

AGTGGC C AC T 
TCACCGGTGA 
AATTTTCGGG 
TTAAAAGCCC 
CCACTCTCTT 
GGTGAGAGAA 
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9801 TCCCCTTGTT ACAAATAGAG 

AGGGGAACAA TGTTTATCTC 
9851 CTGCCTGGGA CACATCTTAA 

GACGGACCCT GTGTAGAATT 
9901 TGTGTTGCCC ATAGCCATAA 

ACACAACGGG TATCGGTATT 
9951 GGCTTGTCCC CACCCCATGG 

CCGAACAGGG GTGGGGTACC 
10001 TCATTCCTAC ACTAGTATTT 

AGTAAGGATG TGATCATAAA 
10051 GGTGTCATAG CACAATGCCA 

CCACAGTATC GTGTTACGGT 
10101 GGGGGCGTCA CCTGAAACCT 

CCCCCGCAGT GGACTTTGGA 
10151 GTTCACAACT CAGCAGTTAT 

CAAGTGTTGA GTCGTC AAT A 
10201 GCAGGCGAAG ATTCAGGAGA 

CGTCCGCTTC TAAGTCCTCT 
10251 CTGCCCTTGT GACTAAAATG 

GACGGGAACA CTGATTTTAC 
10301 TGTAAATAAA ACCGTGACAG 

ACATTTATTT TGGCACTGTC 
10351 AGGACCCTTT TACTAACCCT 

TCCTGGGAAA ATG ATTGGG A 
10401 ACATATGCTA TTGAATTAGG 

TGTATACGAT AACTTAATCC 
10451 GGAAGCATAT GCTACCCGTT 

CCTTCGTATA CGATGGGCAA 
10501 ATTGCTAATG CCCTCTTGAG 

TAACGATTAC GGGAGAACTC 
10551 TCTGATTGAC GTTGGTGTAG 

AGACTAACTG CAACCACATC 
10601 GTACATGTCC CCCAGCATTG 

CATGTACAGG GGGTCGTAAC 
10651 AAAGGCAATG TTGTGTTGCA 

TTTCCGTTAC AACACAACGT 
10701 GAAAGCCACT CAGTGTTGGC 

CTTTCGGTGA GTCACAACCG 
10751 ACTACAGTCA GAGAACCCCT 

TGATGTCAGT CTCTTGGGGA 
10801 GG AACAGGGC CCAGTTGGCA 

CCTTGTCCCG GGTCAACCGT 
10851 GCACTGCCCC GCGAAGAAGG 

CGTGACGGGG CGCTTCTTCC 
10901 TCCGGCGGCG GGGCTCTAGA 
AGGCCGCCGC CCCGAGATCT 
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TGTAACAACA TGGTTCACCT GTCTTGGTCC 
ACATTGTTGT ACCAAGTGGA CAGAACCAGG 
TAACCCCAGT ATCATATTGC ACTAGGATTA 
ATTGGGGTC A TAGTATAACG TGATCCTAAT 
ATTCGTGTGA GATGGACATC CAGTCTTTAC 
TAAGCACACT CTACCTGTAG GTCAGAAATG 
ATTTCTATTG TTAAAGATAT TCAGAATGTT 
TAAAGATAAC AATTTCTATA AGTCTTACAA 
ATTGCCCAAG GGGTTTGTGA GGGTTATATT 
TAACGGGTTC CCCAAACACT CCCAATATAA 
CCACTGAACC CCCCGTCCAA ATTTTATTCT 
GGTGACTTGG GGGGCAGGTT TAAAATAAGA 
TGTTTTCGAG CACCTCACAT ACACCTTACT 
AC AAAAGCTC GTGGAGTGTA TGTGGAATGA 
TCTATTAGCT AAACGAAGGA GAATGAAGAA 
AGATAATCGA TTTGCTTCCT CTTACTTCTT 
GTTCACTGCC CGCTCCTTGA TCTTCAGCCA 
CAAGTGACGG GCGAGGAACT AGAAGTCGGT 
GTTCACTACC CTCGTGGAAT CCTGACCCCA 
CAAGTGATGG GAGCACCTTA GGACTGGGGT 
CTCATGGGGT GGGAGATATC GCTGTTCCTT 
GAGTACCCCA CCCTCTATAG CGACAAGGAA 
AATTCGATAG CATATGCTTC CCGTTGGGTA 
TTAAGCTATC GTATACGAAG GGCAACCCAT 
GTTAGTCTGG ATAGTATATA CTACTACCCG 
CAATCAGACC TATCATATAT GATGATGGGC 
TAGGGTTAAC AAGGGGGCCT TATAAACACT 
ATCCCAATTG TTCCCCCGGA ATATTTGTGA 
GGTCCGCTTA TCGGTAGCTA CACAGGCCCC 
CCAGGCGAAT AGCCATCGAT GTGTCCGGGG 
CCTCCCGTAG TCTTCCTGGG CCCCTGGGAG 
GGAGGGCATC AG AAGG AC CC GGGGACCCTC 
GTGTAAGAGC TTCAGCCAAG AGTTACACAT 
CACATTCTCG AAGTC GGTTC TCAATGTGTA 
GTCCACAGAC TGCAAAGTCT GCTCCAGGAT 
CAGGTGTCTG ACGTTTCAGA CGAGGTCCTA 
AAATGTGCAC ATCCATTTAT AAGGATGTCA 
TTTACACGTG TAGGTAAATA TTCCTACAGT 
TTGTGTTTGG TCCCCCCCCG TGTCACATGT 
AACACAAACC AGGGGGGGGC ACAGTGTACA 
AGTTGTACCA ACCAACTGAA GGGATTACAT 
TCAACATGGT TGGTTG AC TT CCCTAATGTA 
GGCAGAGATG CCGTAGTCAG GTTTAGTTCG 
CCGTCTCTAC GGCATCAGTC CAAATCAAGC 
GTCGACCGGT CATGGCTGCG CCCCGACACC 
CAGCTGGCCA GTACCGACGC GGGGCTGTGG 
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10951 CGCCAACACC CGCTGACGCG CCCTGACGGG CTTGTCTGCT CCCGGCATCC 

GCGGTTGTGG GCGACTGCGC GGGACTGCCC GAACAGACGA GGGCCGTAGG 
11001 GCTTACAGAC AAGCTGTGAC CGTCTCCGGG AGCTGCATGT GTCAGAGGTT 

CGAATGTCTG TTCGACACTG GCAGAGGCCC TCGACGTACA CAGTCTCCAA 
11051 TTCACCGTCA TCACCGAAAC GCGCGAGGCA GCCGGATCAT AATCAGCCAT 

AAGTGGCAGT AGTGGCTTTG CGCGCTCCGT CGGCCTAGTA TTAGTCGGTA 
11101 ACCACATTTG TAGAGGTTTT ACTTGCTTTA AAAAACCTCC CCACCTCCCC 

TGGTGTAAAC ATCTCCAAAA TGAACGAAAT TTTTTGGAGG GGTGGAGGGG 
11151 CTGAACCTGA AAC AT AAAAT GAATGCAATT GTTGTTGTTA ACTTGTTTAT 

GACTTGGACT TTGTATTTTA CTTACGTTAA CAACAACAAT TGAACAAATA 
11201 TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA 

ACGTCGAATA TTACCAATGT TTATTTCGTT ATCGTAGTGT TTAAAGTGTT 
11251 ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC C AAACTC ATC 

TATTTCGTAA AAAAAGTGAC GTAAGATCAA CACCAAACAG GTTTGAGTAG 
11301 AATGTATCTT ATCATGTCTG GATCCCACGT GCAGGCGGGG AGGCGGCCCA 

TTACATAGAA TAGTACAGAC CTAGGGTGCA CGTCCGCCCC TCCGCCGGGT 

113 51 AAGGGAGATC CGACTCGTCT GAGGGCGAAG GCGAAGACGC GGAAGAGGCC 

TTCCCTCTAG GCTGAGCAGA CTCCCGCTTC CGCTTCTGCG CCTTCTCCGG 
11401 GCAGAGCCGG CAGCAGGCCG CGGGAAGGAA GGTCCGCTGG ATTGAGGGCC 
CGTCTCGGCC GTCGTCCGGC GCCCTTCCTT CCAGGCGACC TAACTCCCGG 

114 51 GAAGGGACGT AGCAGAAGGA CGTCCCGCGC AGAATCCAGG TGGCAACACA 

CTTCCCTGCA TCGTCTTCCT GCAGGGCGCG TCTTAGGTCC ACCGTTGTGT 
11501 GGCGAGCAGC CAAGGAAAGG ACGATGATTT CCCCGACAAC ACCACGGAAT 

CCGCTCGTCG GTTCCTTTCC TGCTACTAAA GGGGCTGTTG TGGTGCCTTA 
11551 TGTCAGTGCC CAACAGCCGA GCCCCTGTCC AGCAGCGGGC AAGGCAGGCG 

ACAGTCACGG GTTGTCGGC T CGGGGACAGG TCGTCGCCCG TTCCGTCCGC 
11601 GCGATGAGTT CCGCCGTGGC AATAGGGAGG GGGAAAGCGA AAGTCCCGGA 

CGCTACTCAA GGCGGCACCG TTATCCCTCC CCCTTTCGCT TTCAGGGCCT 
11651 AAGGAGCTGA CAGGTGGTGG CAATGCCCCA ACCAGTGGGG GTTGCGTCAG 

TTCCTCGACT GTCCACCACC GTTACGGGGT TGGTCACCCC CAACGCAGTC 
117 01 CAAACACAGT GCACACCACG CCACGTTGCC TGACAACGGG CCACAACTCC 

GTTTGTGTCA CGTGTGGTGC GGTGCAACGG ACTGTTGCCC GGTGTTGAGG 
117 51 TCATAAAGAG ACAGCAACCA GGATTTATAC AAGGAGGAGA AAATGAAAGC 

AGTATTTCTC TGTCGTTGGT CCTAAATATG TTCCTCCTCT TTTACTTTCG 
11801 CATACGGGAA GCAATAGCAT GATACAAAGG CATTAAAGCA GCGTATCCAC 

GTATGCCCTT CGTTATCGTA CTATGTTTCC GTAATTTCGT CGCATAGGTG 
11851 ATAGCGTAAA AGGAGCAACA TAGTTAAGAA TACCAGTCAA TCTTTCACAA 

TATCGCATTT TCCTCGTTGT ATCAATTCTT ATGGTCAGTT AGAAAGTGTT 
11901 ATTTTGTAAT CCAGAGGTTG ATTC 

TAAAACATTA GGTCTCCAAC TAAG 
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